
Blog
SWE-rebench dataset: More than 21,000 verifiable tasks for SWE agents
SWE-rebench dataset: More than 21,000 verifiable tasks for SWE agents
Our AI R&D team announces the open-source release of the SWE-rebench dataset of more than 21,000 real-world, interactive software engineering tasks. For a detailed methodology and technical report, please see our accompanying paper on arXiv.
Kvax: Fast and easy-to-use FlashAttention implementation for JAX
Kvax: Fast and easy-to-use FlashAttention implementation for JAX
Today, we’re open-sourcing Kvax, our FlashAttention implementation based on JAX. Designed for efficient training with long sequences, Kvax supports context parallelism and optimized computation of document masks. It outperforms many other FlashAttention implementations in long-context training with dense packing, achieving state-of-the-art performance.
Scaling data collection for training software engineering agents
Scaling data collection for training software engineering agents
In this follow-up to our previous research blog post, we focus on the data collection process used to train our action generator and critic models. We’re also releasing two datasets on Hugging Face: nebius/SWE-bench-extra, containing 6,411 Issue/Pull-Request pairs, and nebius/SWE-agent-trajectories, featuring 80,036 software engineering agent trajectories, where an agent attempts to solve these issues.