Question 1

What does Understudy optimize?

Accepted Answer

Understudy optimizes complete production routes for repeated LLM work: the harness, model, and supply path. That includes prompts, schemas, tool-call adapters, reasoning mode, token caps, scorers, retry policy, batching, context compaction, parsers, model choice, fine-tuned descendants, and serving path.

Question 2

Do we need to move our workflow into a hosted app?

Accepted Answer

No. The CLI, MCP server, skills, and local workbench run inside the coding agents and environments your team already uses. Hosted infrastructure is optional when an optimization needs cloud training or serving.

Question 3

How does Understudy know whether a smaller model is good enough?

Accepted Answer

The system turns production traces and expert review into evals. A cheaper route only replaces a frontier baseline after it satisfies the task-specific quality bar, with failures and uncertainty escalated, repaired, or converted into training data.

Question 4

When should we keep using a frontier model?

Accepted Answer

Keep the frontier model where premium capability changes the outcome. For routine agentic operations, the goal is enough intelligence at the right latency and price: classify a message, choose a tool, fill structured arguments, repair malformed calls, or create signal for later optimization.

Question 5

Do we own the resulting models?

Accepted Answer

Yes. The goal is to hand off prompts, evaluators, routing rules, and specialist model weights that your team can serve on Fireworks, Bedrock, Vertex, or your own GPUs.

Question 6

What teams are a fit for private preview?

Accepted Answer

The best fit is a team with a real production LLM workload, meaningful cost or latency pressure, repeated task volume, and domain experts who can review outputs.

Don’t Use Their Models.
Use Yours.Don’t Use Their Models.Use Yours.

Understudy Optimizes the Complete LLM Production Route

Capture

Evaluate

Train

Deploy

Proof of performance

Frequently asked questions

Interested?

Don’t Use Their Models.Use Yours.Don’t Use Their Models.Use Yours.