logo
Railtracks Docs
Evaluation
Initializing search
    railtracks
    • Home
    • Agent Development
    • Retrieval
    • Evaluations
    • Observability
    • Integrations
    • Tutorials
    • API Reference
    railtracks
    • Home
        • Quickstart
        • Installation
        • LLM Setup
        • AI Coding Assistants
        • Overview
          • Function Tools
          • Agents as Tools
          • MCP
          • Modalities
          • Failures
        • Direct Invocation
        • Flow Invocation
          • Overview
          • Quickstart
          • Built-in Guardrails
        • Agent Context
        • Quickstart
        • Ingestion
        • Retrieval
        • Module Packages
        • Design
          • Base Class
          • Built-in Loaders
          • Base Class
          • Built-in Methods
          • Overview
          • Built-in Methods
          • Base Class
          • Backends
      • Preface
      • Quickstart
      • Visualization
        • Metrics Overview
        • Categorical Metrics
        • Numerical Metrics
        • Evaluator Abstraction
        • ToolUseEvaluator
        • LLMInferenceEvaluator
        • JudgeEvaluator
        • Local
        • Cloud
        • Overview
        • Local Chat Interface
        • Terminal Interface
        • Custom
        • Logging
        • Broadcasting
        • Error Handling
      • Overview
        • Providers
        • Platforms
        • Streaming
        • Github
        • Slack
        • Email
        • Overview
        • AWS S3
        • Azure Blob Storage
        • Google Cloud Storage
        • SQL Databases
        • Python Sandbox
        • Shell
        • Websearch
          • Building your first agent
          • Running your first agent
          • Agents as Tools
          • Prompts and Context
          • Flows
          • FastAPI Integration
          • Multiagent Systems
          • Multimodal Agent
          • AgentHub
          • Evaluation
        • Agent Architectures
        • FastAPI Integration
        • File Embedding (RAG)
          • Agents
          • Tools
          • MCP
          • Prompts and Context
          • RAG
          • Async/Await
          • Vector Stores
          • Overview
          • Validation Loops
          • Sequential Flows
    • API Reference
    1. Home
    2. Tutorials
    3. Walkthroughs
    4. Videos

    Evaluation

    Previous
    AgentHub
    Next
    Agent Architectures