Grey Newell - Evaluation Infrastructure Engineer

Grey Newell

Evaluation Infrastructure Engineer
Building eval infrastructure for AI systems. Creator of the MIST stack. MS CS (ML) at Georgia Tech. Ex-AWS, 12x certified.
Contact About

Projects

Eval framework. Define correct, test against it, get results.

21 Go Website

Route inference across LLM providers. Track cost per request.

89 Go Website

Structured data compiler. Pass pipeline, pluggable backends.

11 Go Website

Where did your tokens go? Spans, latency percentiles, alerts.

5 Go Website

Shared core for the MIST stack. Zero external deps.

1 Go

Ship evals before you ship features.

7 Markdown Website

Frequently asked questions

MIST Stack

What is the MIST stack?
What is eval-driven development?
What is MatchSpec and how does it work?
What is InferMux and how does it route inference?
What is SchemaFlux?
What is TokenTrace?
Why does the MIST stack have zero external dependencies?
How do MIST stack tools communicate?

Technical Publications & Projects

What technical articles has Grey Newell published on the AWS blog?