I'm John! Currently a 2nd year CS PhD student at Stanford University.
Check out john-b-yang.github.io for more.
I'm John! Currently a 2nd year CS PhD student at Stanford University.
Check out john-b-yang.github.io for more.
Can Language Models Rebuild Programs From Scratch?
Benchmarking Goal-Oriented Software Engineering
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
SWE-bench: Can Language Models Resolve Real-world Github Issues?
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents