Skip to content
View john-b-yang's full-sized avatar
🐶
wuphf.com
🐶
wuphf.com

Highlights

  • Pro

Organizations

@saasbook @SoftwareDefinedBuildings @61c-teach @SWE-bench

Block or report john-b-yang

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
john-b-yang/README.md

Hey there 👋

I'm John! Currently a 2nd year CS PhD student at Stanford University.

Check out john-b-yang.github.io for more.

Pinned Loading

  1. facebookresearch/ProgramBench facebookresearch/ProgramBench Public

    Can Language Models Rebuild Programs From Scratch?

    Python 814 54

  2. CodeClash-ai/CodeClash CodeClash-ai/CodeClash Public

    Benchmarking Goal-Oriented Software Engineering

    Python 176 17

  3. SWE-agent/SWE-agent SWE-agent/SWE-agent Public

    SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

    Python 19.7k 2.2k

  4. SWE-bench/SWE-bench SWE-bench/SWE-bench Public

    SWE-bench: Can Language Models Resolve Real-world Github Issues?

    Python 5.3k 906

  5. SWE-agent/mini-swe-agent SWE-agent/mini-swe-agent Public

    The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

    Python 5.5k 757

  6. SWE-bench/SWE-smith SWE-bench/SWE-smith Public

    [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

    Python 688 124