Skip to content
View ControlNet's full-sized avatar

Block or report ControlNet

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ControlNet/README.md

🎓 I'm a research fellow (postdoc) studying in Computer Vision and Artificial Intelligence area. Now I am mainly working on

  • Visual Analysis & Reasoning
  • Neuro-Symbolic & AI Agent
  • Deepfakes

🔎 Reviewer of CVPR, ICCV, ECCV, NeurIPS, ACM MM, ICRA, TPAMI, TMM, TAFFC, and more.

🖥️ I enjoy programming and implementing some cool ideas.

🧰 Also, I love discovering and fine-tuning tools in my hand; both software tools and physical tools (zsh environment, syntax highlighting).

🔔 lllyasviel/ControlNet is a great work and uses the same name, but it's unrelated to me.

💾 Programming Languages and Tools

📮 Contact

Pinned Loading

  1. MARLIN MARLIN Public

    [CVPR] MARLIN: Masked Autoencoder for facial video Representation LearnINg

    Python 259 27

  2. AV-Deepfake1M AV-Deepfake1M Public

    [ACM MM Award] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

    Python 168 12

  3. LAV-DF LAV-DF Public

    [CVIU, DICTA Award] Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization

    Python 103 18

  4. HYDRA HYDRA Public

    [ECCV] HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning

    Python 21 5

  5. NAVER NAVER Public

    [ICCV] NAVER: A Neuro-Symbolic Compositional Automaton for Visual Grounding with Explicit Logic Reasoning

    Python 26 3

  6. pokerme7777/Compositional-Visual-Reasoning-Survey pokerme7777/Compositional-Visual-Reasoning-Survey Public

    Explain Before You Answer: A Survey on Compositional Visual Reasoning

    295 33