Blog - Grey Newell

SWE-bench Verified: How fail_to_pass Tests and Task Instances Work (And Why It's Broken)

March 6, 2026

How SWE-bench Verified's fail_to_pass and pass_to_pass tests and task instances actually work — and why every frontier model score is contaminated. Source code analysis.

AI Open Source Evaluation

SWE-bench Tests Run 6x Faster on ARM64 with Native Containers

March 5, 2026

SWE-bench's pre-built x86 containers run through QEMU emulation on ARM64 hosts like Apple Silicon and AWS Graviton. I built native ARM64 images and measured a 6.3x speedup on the test runner.

AI Open Source Evaluation Go

Why Code Graphs Matter for AI Agents Supermodel Engineering Blog

March 2, 2026

AI coding agents lose critical structural understanding of codebases when context compaction occurs. Code graphs provide persistent external memory—representing functions, classes, and dependencies as queryable relationships—so agents can recover context without re-reading files from scratch.

AI Developer Tools Code-graphs

Building Uncompact: Lessons from Production Supermodel Engineering Blog

February 28, 2026

How Supermodel built Uncompact—a tool that maintains a persistent code graph across Claude Code's context compaction events—and the key lessons learned shipping it to production: simplicity over detail, invisibility enables adoption, and layered verification over blind trust.

AI Developer Tools Code-graphs

The Architecture of Supermodel's Code Graph API Supermodel Engineering Blog

February 25, 2026

A look inside Supermodel's real-time code analysis API: the five-stage processing pipeline, multi-language abstraction via a unified node schema, incremental graph updates, and the sub-100ms response time requirement that shaped every design decision.

AI Developer Tools Code-graphs Architecture

Implement Event-Driven Invoice Processing for Resilient Financial Monitoring at Scale AWS Architecture Blog

May 12, 2025

How to build a Business Event Monitoring System (BEMS) on AWS that handles over 86 million daily events with near real-time visibility, cross-Region controls, and automated alerts for stuck events.

Aws Architecture Serverless Event-driven

Zero to Hero: Your Guide to Career Growth Through AWS Certifications AWS Training and Certification Blog

March 20, 2025

Learn practical strategies that helped me transform from a struggling new graduate to an AWS Solutions Architect, eventually earning the coveted golden jacket awarded to those who achieve all twelve AWS Certifications.

Aws

5 Tips for AWS Certification Exams from AWS Solutions Architects AWS Training and Certification Blog

February 20, 2023

We're both solutions architects at AWS, and between us, we hold 10 active AWS Certifications. Here are five tips AWS Solutions Architects swear by to prepare for and pass AWS Certification exams.

Aws