MLflow Tracing Instrumentation Guide

Name: instrumenting-with-mlflow-tracing
Rating: 87
Author: mlflow

Language-Specific Guides

Based on the user's project, load the appropriate guide:

If unclear, check for package.json (TypeScript) or requirements.txt/pyproject.toml (Python) in the project.

Trace these operations (high debugging/observability value):

Operation Type	Examples	Why Trace
Root operations	Main entry points, top-level pipelines, workflow steps	End-to-end latency, input/output logging
LLM calls	Chat completions, embeddings	Token usage, latency, prompt/response inspection
Retrieval	Vector DB queries, document fetches, search	Relevance debugging, retrieval quality
Tool/function calls	API calls, database queries, web search	External dependency monitoring, error tracking
Agent decisions	Routing, planning, tool selection	Understand agent reasoning and choices
External services	HTTP APIs, file I/O, message queues	Dependency failures, timeout tracking

Skip tracing these (too granular, adds noise):

Rule of thumb: Trace operations that are important for debugging and identifying issues in your application.

Log user feedback on traces for evaluation, debugging, and fine-tuning. Essential for identifying quality issues in production.

See references/feedback-collection.md for:

See references/production.md for:

See references/advanced-patterns.md for:

See references/distributed-tracing.md for: