SALT SLA

university

http://stanford.edu/

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

john-b-yang authored a paper about 2 months ago

CodeClash: Benchmarking Goal-Oriented Software Engineering

john-b-yang authored a paper 8 months ago

SWE-smith: Scaling Data for Software Engineering Agents

john-b-yang authored a paper 8 months ago

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

View all activity

john-b-yang

authored a paper about 2 months ago

CodeClash: Benchmarking Goal-Oriented Software Engineering

Paper • 2511.00839 • Published Nov 2 • 9

john-b-yang

authored 7 papers 8 months ago

SWE-smith: Scaling Data for Software Engineering Agents

Paper • 2504.21798 • Published Apr 30 • 11

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Paper • 2310.06770 • Published Oct 10, 2023 • 9

InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback

Paper • 2306.14898 • Published Jun 26, 2023

DevBench: A Comprehensive Benchmark for Software Development

Paper • 2403.08604 • Published Mar 13, 2024 • 2

WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Paper • 2207.01206 • Published Jul 4, 2022 • 3

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

Paper • 2405.15793 • Published May 6, 2024 • 7

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 43

WillHeld

authored 3 papers 10 months ago

Focus on what matters: Applying Discourse Coherence Theory to Cross Document Coreference

Paper • 2110.05362 • Published Oct 11, 2021

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 77

Mind the Gap! Static and Interactive Evaluations of Large Audio Models

Paper • 2502.15919 • Published Feb 21 • 4

WillHeld

authored a paper about 1 year ago

Distilling an End-to-End Voice Assistant Without Instruction Training Data

Paper • 2410.02678 • Published Oct 3, 2024 • 23

WillHeld

authored 8 papers almost 2 years ago

Can Large Language Models Transform Computational Social Science?

Paper • 2305.03514 • Published Apr 12, 2023 • 1

A Material Lens on Coloniality in NLP

Paper • 2311.08391 • Published Nov 14, 2023

Shapley Head Pruning: Identifying and Removing Interference in Multilingual Transformers

Paper • 2210.05709 • Published Oct 11, 2022

AI & ML interests

Recent Activity

Team members 2

salt-sla's activity