# Connect to MLFlow

Send agent traces from any framework to MLflow on Red Hat OpenShift AI.

MLflow tracing captures every LLM call, tool invocation, and agent state transition as structured spans. On OpenShift AI, MLflow runs as a managed service that your agent connects to via environment variables — no code changes needed when moving between standalone and operator-managed deployments. The pattern is the same across all frameworks: read the tracking URI from the environment, optionally authenticate with a service account token, and call the framework's `autolog()` function. Every trace is then automatically collected, including LLM inputs/outputs, latency, token counts, and tool results. ## OpenShift AI Setup MLflow on OpenShift AI supports two deployment modes: **standalone** (a Deployment + Service + PVC managed by your Helm chart) and **CR mode** (an MLflow custom resource managed by the MLflow operator). Both expose the same tracking API — only the connection details differ. ### Environment variables Your agent reads these environment variables at startup. In standalone mode, only `MLFLOW_TRACKING_URI` is required. In CR mode, the workspace and token file are also needed. {envVarsHighlighted}

Variable	Required	Description
`MLFLOW_TRACKING_URI`	Yes	MLflow server URL. Set automatically by the Helm chart.
`MLFLOW_EXPERIMENT_NAME`	No	Experiment name. Defaults to the agent name.
`MLFLOW_WORKSPACE`	CR only	Namespace for multi-tenant isolation via the operator gateway.
`MLFLOW_TRACKING_TOKEN_FILE`	CR only	Path to the service account token for gateway authentication.
`REQUESTS_CA_BUNDLE`	CR only	CA bundle for TLS to the operator-managed MLflow gateway.

### Authentication In CR mode, the agent authenticates to the MLflow operator gateway using a Kubernetes service account token. The token is mounted at the standard path and read at startup: {tokenAuthHighlighted} The operator gateway also requires a merged CA bundle (system CAs + Kubernetes service CA) for TLS verification. This is handled by an init container in the deployment. ## Agent Frameworks ### LangGraph [LangGraph](https://langchain-ai.github.io/langgraph/) is the most common framework for building stateful, multi-actor agent applications. MLflow's `mlflow.langchain.autolog()` automatically traces all LangChain and LangGraph components — LLM calls, tool executions, graph node transitions, and state checkpoints. This example is from the [bank-voice-agent](https://github.com/redhat-et/bank-voice-agent) reference architecture, which runs a multi-agent banking assistant on OpenShift AI with full MLflow observability.

mlflow.langchain.autolog() — Traces LLM calls, tool use, and graph state transitions

With `autolog()` enabled, every call to `graph.invoke()` or `graph.stream()` produces a trace with spans for each node, LLM invocation, and tool call. No manual callbacks are needed. ### CrewAI [CrewAI](https://www.crewai.com/) orchestrates role-based AI agents working together as a crew. MLflow's `mlflow.crewai.autolog()` captures each agent's task execution, tool calls, and crew-level orchestration.

mlflow.crewai.autolog() — Traces crew orchestration, agent tasks, and tool calls

### AutoGen [AutoGen](https://microsoft.github.io/autogen/) enables multi-agent conversations where agents collaborate, debate, and solve problems together. MLflow's `mlflow.autogen.autolog()` traces each agent turn, message exchange, and termination condition.

mlflow.autogen.autolog() — Traces agent conversations, turns, and group chat flow

### LlamaIndex [LlamaIndex](https://www.llamaindex.ai/) specializes in RAG pipelines and data-connected agents. MLflow's `mlflow.llama_index.autolog()` captures document loading, embedding, retrieval, and query engine execution.

mlflow.llama_index.autolog() — Traces RAG retrieval, embedding, and query execution

### Google ADK [Google Agent Development Kit (ADK)](https://google.github.io/adk-docs/) builds agents using Gemini models with built-in tool use. ADK uses OpenTelemetry natively — traces can be exported to MLflow's OTLP endpoint or via the `mlflow.tracing` API.

OpenTelemetry export — Traces agent runs, tool calls, and Gemini model interactions

## OpenShift Deployment The Helm chart handles MLflow deployment and injects the correct environment variables into your agent's pod. The configuration differs between standalone and CR mode. ### Standalone mode Deploys MLflow as a Deployment + Service + PVC in your namespace. The agent connects directly via HTTP. This is the simplest setup and works on any OpenShift cluster. See the [chart deployment template](https://github.com/eformat/bank-voice-agent/blob/main/ai-voice-agent/deploy/chart/templates/mlflow-deployment.yaml) for a full working example. {valuesYamlHighlighted} ### CR mode (MLflow Operator) Uses the MLflow operator to manage MLflow as a custom resource. The operator provides a gateway that handles multi-tenant workspace isolation and service account authentication. An init container merges CA certificates for TLS. See the [chart CR template](https://github.com/eformat/bank-voice-agent/blob/main/ai-voice-agent/deploy/chart/templates/mlflow-cr.yaml) for a full working example. {deploymentYamlHighlighted} {initContainerYamlHighlighted}