Chain-of-Causation (CoC) is Alpamayo 1’s approach to generating interpretable, step-by-step reasoning traces that explain why the model predicts a particular trajectory. Unlike black-box motion prediction models, Alpamayo 1 produces human-readable explanations alongside its driving outputs.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/NVlabs/alpamayo/llms.txt
Use this file to discover all available pages before exploring further.
What is Chain-of-Causation?
Chain-of-Causation reasoning generates natural language descriptions that:- Identify relevant scene elements (pedestrians, vehicles, traffic signals, etc.)
- Analyze their causal relationships to ego vehicle behavior
- Explain the predicted trajectory in terms of those relationships
CoC traces are generated before trajectory prediction, allowing the reasoning to guide the expert model’s action predictions through shared context.
Example Chain-of-Causation
Architecture Integration
CoC reasoning is deeply integrated into Alpamayo 1’s two-stage architecture:Stage 1: VLM Reasoning Generation
The Vision-Language Model generates CoC traces through autoregressive text generation:- Sampling strategy: Nucleus sampling (top-p) with temperature scaling
- Default parameters:
top_p=0.98,temperature=0.6 - Stop condition: Generation stops at
<|traj_future_start|>token - Masking: Discrete trajectory tokens are masked out (set to
-inf) during CoC generation
Stage 2: Expert Model Uses CoC Context
The expert model reuses the VLM’s KV (key-value) cache, which contains the encoded CoC reasoning:Special Tokens for Structure
Alpamayo 1 uses special tokens to delineate different parts of the output:Typical Token Sequence
Training Data & Supervision
From the README (lines 89-92):| Feature | Paper Description | Release (v1.0) |
|---|---|---|
| Chain-of-Causation (CoC) reasoning | Hybrid auto-labeling with human in the loop for reasoning traces | ✅ Included |
- Auto-labeling: Automated generation of reasoning candidates
- Human-in-the-loop: Human reviewers validate and refine reasoning quality
- Supervision: Model is trained to generate these traces via standard language modeling loss
Extracting CoC from Model Outputs
When running inference withreturn_extra=True, CoC traces are returned separately:
Multi-sample generation: When
num_traj_samples > 1, each sample gets its own CoC trace due to stochastic sampling, providing diverse reasoning explanations.Reasoning Quality & Limitations
Strengths
- Interpretability: Human-readable explanations of driving decisions
- Scene understanding: Demonstrates awareness of relevant objects and their states
- Causal relationships: Links observations to predicted actions
- Multi-modal grounding: Reasoning is grounded in visual observations
Current Limitations
From README FAQ (lines 117-120):While the paper describes RL stages for improving reasoning quality and action consistency, this release focuses on the supervised learning components. We may release RL post-trained models in future releases.
Customizing Reasoning Generation
You can control CoC generation through sampling parameters:- Lower temperature (0.3-0.5): More focused, deterministic reasoning
- Higher temperature (0.7-1.0): More diverse, creative explanations
- top_p closer to 1.0: Include more token candidates (more diverse)
- top_p closer to 0.0: Restrict to highest-probability tokens (more deterministic)
Research Applications
Chain-of-Causation reasoning enables several research directions:- Interpretability studies: Analyze what scene elements the model attends to
- Failure analysis: Understand why the model makes mistakes
- Trust & safety: Verify that predictions are based on appropriate reasoning
- Data augmentation: Use reasoning traces for auto-labeling new data
- Human-AI collaboration: Enable human oversight of autonomous systems
Architecture Overview
Understand how CoC fits into the overall architecture
Inputs & Outputs
Detailed format of CoC outputs