Hayhooks

hayhooks
docs
concepts

yaml-pipeline-deployment.md•10 KiB

# YAML Pipeline Deployment Hayhooks supports deploying Haystack pipelines directly from YAML definitions. This approach builds request/response schemas automatically from the YAML-declared `inputs` and `outputs`. ## Overview YAML pipeline deployment is ideal for: - Simple pipelines with clear inputs and outputs - Quick deployment without wrapper code - Automatic schema generation - CI/CD pipeline deployments !!! tip "Converting Existing Pipelines" If you already have a Haystack `Pipeline` instance, you can serialize it with `pipeline.dumps()` and then manually add the required `inputs` and `outputs` sections before deploying. ## Requirements ### YAML Structure Your YAML file must include both `inputs` and `outputs` sections: ```yaml components: converter: type: haystack.components.converters.html.HTMLToDocument init_parameters: extraction_kwargs: null fetcher: init_parameters: raise_on_failure: true retry_attempts: 2 timeout: 3 user_agents: - haystack/LinkContentFetcher/2.0.0b8 type: haystack.components.fetchers.link_content.LinkContentFetcher llm: init_parameters: api_key: env_vars: - OPENAI_API_KEY strict: true type: env_var generation_kwargs: {} model: gpt-4o-mini type: haystack.components.generators.chat.openai.OpenAIChatGenerator prompt: init_parameters: template: | {% message role="user" %} According to the contents of this website: {% for document in documents %} {{document.content}} {% endfor %} Answer the given question: {{query}} {% endmessage %} required_variables: "*" type: haystack.components.builders.chat_prompt_builder.ChatPromptBuilder connections: - receiver: converter.sources sender: fetcher.streams - receiver: prompt.documents sender: converter.documents - receiver: llm.messages sender: prompt.prompt inputs: urls: fetcher.urls query: prompt.query outputs: replies: llm.replies ``` ### Key Requirements 1. **`inputs` Section**: Maps friendly names to pipeline component fields 2. **`outputs` Section**: Maps pipeline outputs to response fields 3. **Valid Components**: All components must be properly defined 4. **Valid Connections**: All connections must reference existing components ## Deployment Methods === "CLI" ```bash # Deploy with default settings hayhooks pipeline deploy-yaml pipelines/chat_pipeline.yml # Deploy with custom name hayhooks pipeline deploy-yaml -n my_chat_pipeline pipelines/chat_pipeline.yml # Deploy with description hayhooks pipeline deploy-yaml -n my_chat_pipeline --description "Chat pipeline for Q&A" pipelines/chat_pipeline.yml # Overwrite existing pipeline hayhooks pipeline deploy-yaml -n my_chat_pipeline --overwrite pipelines/chat_pipeline.yml ``` === "HTTP API" ```bash curl -X POST \ http://localhost:1416/deploy-yaml \ -H 'Content-Type: application/json' \ -d '{ "name": "my_chat_pipeline", "description": "Chat pipeline for Q&A", "source_code": "...", "overwrite": false }' ``` === "Python" ```python import requests response = requests.post( "http://localhost:1416/deploy-yaml", json={ "name": "my_chat_pipeline", "description": "Chat pipeline for Q&A", "source_code": "...", # Your YAML content as string "overwrite": False } ) print(response.json()) ``` ## CLI Options | Option | Short | Description | Default | |--------|-------|-------------|---------| | `--name` | `-n` | Override the pipeline name | YAML file stem | | `--description` | | Human-readable description | Pipeline name | | `--overwrite` | `-o` | Overwrite if pipeline exists | `false` | | `--skip-mcp` | | Skip MCP tool registration | `false` | | `--save-file` | | Save YAML to server | `true` | | `--no-save-file` | | Don't save YAML to server | `false` | ## Input/Output Mapping ### Input Mapping The `inputs` section maps friendly names to pipeline component fields: ```yaml inputs: # friendly_name: component.field urls: fetcher.urls query: prompt_builder.query ``` **Mapping rules:** - Use `component.field` syntax - Field must exist in the component - Multiple inputs can map to the same component field - Input names become API parameters - Use a YAML list when the same external field should feed **multiple** component inputs ```yaml inputs: query: - chat_summary_prompt_builder.query - answer_builder.query ``` !!! note "How multi-target inputs are resolved" Hayhooks normalizes list-declared inputs by looking at the first valid target to derive type metadata and marks the input as **required** regardless of component-level metadata. At runtime the resolved value is fanned out to **all** listed component inputs, so the example above sends the same payload to both `chat_summary_prompt_builder.query` and `answer_builder.query` even if the external parameter is named differently (for example `a_query`). This ensures downstream components get the expected inputs while the API still exposes a single friendly field. ### Output Mapping The `outputs` section maps pipeline outputs to response fields: ```yaml outputs: # response_field: component.field replies: llm.replies documents: fetcher.documents ``` **Mapping rules:** - Use `component.field` syntax - Field must exist in the component - Response fields are serialized to JSON - Complex objects are automatically serialized !!! success "Automatic `include_outputs_from` Derivation" Hayhooks **automatically** derives the `include_outputs_from` parameter from your `outputs` section. This ensures that all components referenced in the outputs are included in the pipeline results, even if they're not leaf components. **Example:** If your outputs reference `retriever.documents` and `llm.replies`, Hayhooks automatically sets `include_outputs_from={"retriever", "llm"}` when running the pipeline. **What this means:** You don't need to configure anything extra - just declare your outputs in the YAML, and Hayhooks ensures those component outputs are available in the results! !!! note "Comparison with PipelineWrapper" **YAML Pipelines** (this page): `include_outputs_from` is **automatic** - derived from your `outputs` section **PipelineWrapper**: `include_outputs_from` must be **manually passed**: - For streaming: Pass to `streaming_generator()` / `async_streaming_generator()` - For non-streaming: Pass to `pipeline.run()` / `pipeline.run_async()` See [PipelineWrapper: include_outputs_from](pipeline-wrapper.md#accessing-intermediate-outputs-with-include_outputs_from) for examples. ## API Usage ### After Deployment Once deployed, your pipeline is available at: - **Run Endpoint**: `/{pipeline_name}/run` - **Schema**: `/{pipeline_name}/schema` - **OpenAPI**: `/openapi.json` ### Example Request ```bash curl -X POST \ http://localhost:1416/my_chat_pipeline/run \ -H 'Content-Type: application/json' \ -d '{ "urls": ["https://haystack.deepset.ai"], "query": "What is Haystack?" }' ``` ### Example Response ```json { "replies": ["Haystack is an open source framework..."], "documents": [...], "metadata": {...} } ``` ## Schema Generation Hayhooks automatically generates: ### Request Schema ```json { "type": "object", "properties": { "urls": { "type": "array", "items": {"type": "string"} }, "query": {"type": "string"} }, "required": ["urls", "query"] } ``` ### Response Schema ```json { "type": "object", "properties": { "replies": {"type": "array"}, "documents": {"type": "array"}, "metadata": {"type": "object"} } } ``` ## Obtaining YAML from Existing Pipelines You can obtain YAML from existing Haystack pipelines: ```python from haystack import Pipeline # Create or load your pipeline pipeline = Pipeline() # ... add components and connections ... # Get YAML representation yaml_content = pipeline.dumps() # Save to file with open("pipeline.yml", "w") as f: f.write(yaml_content) ``` !!! note You'll need to manually add the `inputs` and `outputs` sections to the generated YAML. ## Limitations !!! warning "YAML Pipeline Limitations" YAML-deployed pipelines have the following limitations: - :octicons-x-circle-16: **No OpenAI Compatibility**: Don't support OpenAI-compatible chat endpoints - :octicons-x-circle-16: **No Streaming**: Streaming responses are not supported - :octicons-x-circle-16: **No File Uploads**: File upload handling is not available - :material-lightning-bolt: **Async Only**: Pipelines are run as `AsyncPipeline` instances !!! tip "Using PipelineWrapper for Advanced Features" For advanced features, use `PipelineWrapper` instead: ```python # For OpenAI compatibility class PipelineWrapper(BasePipelineWrapper): def run_chat_completion(self, model: str, messages: list[dict], body: dict) -> str | Generator: ... # For file uploads class PipelineWrapper(BasePipelineWrapper): def run_api(self, files: list[UploadFile] | None = None, query: str = "") -> str: ... # For streaming class PipelineWrapper(BasePipelineWrapper): def run_chat_completion_async(self, model: str, messages: list[dict], body: dict) -> AsyncGenerator: ... ``` ## Example For a complete working example of a YAML pipeline with proper `inputs` and `outputs`, see: - [tests/test_files/yaml/inputs_outputs_pipeline.yml](https://github.com/deepset-ai/hayhooks/blob/main/tests/test_files/yaml/inputs_outputs_pipeline.yml) This example demonstrates: - Complete pipeline structure with components and connections - Proper `inputs` mapping to component fields - Proper `outputs` mapping from component results - Real-world usage with `LinkContentFetcher`, `HTMLToDocument`, `ChatPromptBuilder`, and `OpenAIChatGenerator` ## Next Steps - [PipelineWrapper](pipeline-wrapper.md) - For advanced features like streaming and chat completion - [Examples](../examples/overview.md) - See working examples

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/deepset-ai/hayhooks'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

yaml-pipeline-deployment.md•10 KiB