stanfordnlp/dspy · Archaeologist

From the Field

“DSPy: The most important LM framework you're not using yet.”

Verdict:Reach for it

Reach for it when

You need to build reliable, optimizable LM pipelines beyond simple prompting.

Look elsewhere when

You just want a quick one-off chat completion or are allergic to learning a new paradigm.

In context

It's like LangChain but with actual programmatic optimization instead of just orchestration.

Complexity●●●Heavy

Read time~30 minutes

Language

Python

Runtime

Python >=3.10, <3.15

Dependencies

0total

What using it looks like

Drawn from the project's README

From the README

pip install dspy

Fig. 1 — example 1 of 2

What this is

As told for the tourist

What Is This?

DSPy is a toolkit that lets you teach AI language models (like ChatGPT) to do complex tasks by writing regular Python code instead of endlessly tweaking prompts. Think of it as a coach that helps you train an AI to follow a recipe, rather than you having to hand-write every single instruction each time.

What Can You Do With It?

You could use this to build a system that automatically answers customer support questions by searching your company's knowledge base, or create a tool that summarizes long research papers into bullet points. The README shows you can build "simple classifiers, sophisticated RAG pipelines, or Agent loops" — which just means anything from sorting emails into folders to building a research assistant that reads documents and writes reports.

Here's how simple it is to get started:

pip install dspy

Then you'd write something like:

import dspy

# Define what you want the AI to do

class AnswerQuestion(dspy.Module):

def __init__(self):

self.answer = dspy.Predict("question -> answer")

def forward(self, question):

return self.answer(question=question)

That's it — you've just created a reusable AI program.

pip install dspy

import dspy

# Define what you want the AI to do
class AnswerQuestion(dspy.Module):
    def __init__(self):
        self.answer = dspy.Predict("question -> answer")
    
    def forward(self, question):
        return self.answer(question=question)

How It Works (No Jargon)

1. Programs instead of prompts — It's like writing a recipe for a chef, rather than shouting instructions at them each time. You define the steps (search, summarize, format) once, and DSPy remembers the pattern.

2. Automatic teaching — It's like having a tutor that watches how the AI performs, then quietly adjusts the instructions to make it better. You give it examples of good outputs, and it figures out the best way to ask the AI to produce those results.

3. Self-improvement — Imagine a coach that watches your golf swing, then suggests tiny adjustments to your grip and stance. DSPy does this for AI prompts — it tries different wordings and examples, keeps what works, and throws away what doesn't.

What's Cool About It?

You stop fighting with prompts. Normally, you'd spend hours tweaking "Please answer concisely" to "Answer in 2-3 sentences" and still get inconsistent results. DSPy treats prompts like code — you write the logic once, and it optimizes the wording automatically.

It works with any AI model. Whether you're using OpenAI's GPT, an open-source model running on your laptop, or something from Anthropic, DSPy handles the differences. You write your program once, and it adapts to whatever AI engine you plug in.

Who Should Care?

Reach for this if you're building anything that uses AI to process information in multiple steps — like a research assistant, a customer service bot, or a content generator. It's perfect if you're tired of copy-pasting prompts and want something that actually improves over time.

Skip it if you just need a one-off answer from ChatGPT, or if you're building a simple chatbot that doesn't need to follow complex instructions. For basic "ask a question, get an answer" scenarios, DSPy is overkill — like using a full kitchen to make toast.

Start Here

A recommended reading path through the code

Start Here

A recommended reading path through the code

01
dspy/__init__.py
Reveals the public API surface and top-level abstractions, giving a bird's-eye view of the entire library's structure.
02
dspy/teleprompt/__init__.py
Exposes the key optimization/teleprompter classes, which are central to DSPy's core workflow of program optimization.
03
dspy/teleprompt/bootstrap.py
Implements BootstrapFewShot, a fundamental teleprompter that demonstrates the core pattern of composing demonstrations from traces.
04
dspy/teleprompt/copro_optimizer.py
Shows a more advanced optimization strategy (COPRO) for iterative instruction generation, revealing how prompts are refined.
05
dspy/clients/openai.py
Illustrates the client/provider abstraction for interacting with LLMs, a critical architectural layer for model access.