XiYan MCP Server
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
Integrations
Integrates with Alibaba Cloud's Qwen and DashScope APIs to provide text-to-SQL functionality using Qwen-max and xiyansql-qwencoder-32b models
Links to paper resources published on arXiv related to the XiYan-SQL technology
Provides access to model resources and code repositories hosted on GitHub
Table of Contents
Features
- π Fetch data by natural language through XiYanSQL
- π€ Support general LLMs (GPT,qwenmax), Text-to-SQL SOTA model
- π» Support pure local mode (high security!)
- π Support MySQL and PostgreSQL.
- π±οΈ List available tables as resources
- π§ Read table contents
Preview
Architecture
There are two ways to integrate this server in your project, as shown below: The left is remote mode, which is the default mode. It requires an API key to access the xiyanSQL-qwencoder-32B model from service provider (see Configuration). Another mode is local mode, which is more secure. It does not require an API key.
Best practice
Tools Preview
- The tool
get_data
provides a natural language interface for retrieving data from a database. This server will convert the input natural language into SQL using a built-in model and call the database to return the query results. - The
{dialect}://{table_name}
resource allows obtaining a portion of sample data from the database for model reference when a specific table_name is specified. - The
{dialect}://
resource will list the names of the current databases
Installation
Installing from pip
Python 3.11+ is required. you can install the server through pip, and it will install the latest verion
After that you can directly run the server by:
But it does not provide any functions until you complete following config. You will get a yml file. After that you can run the server by:
Installing from Smithery.ai
See @XGenerationLab/xiyan_mcp_server
Not fully tested.
Configuration
You need a yml config file to configure the server. a default config file is provided in config_demo.yml which looks like this:
LLM Configuration
Name
is the name of the model to use, key
is the API key of the model, url
is the API url of the model. We support following models.
versions | general LLMs(GPT,qwenmax) | SOTA model by Modelscope | SOTA model by Dashscope | Local LLMs |
---|---|---|---|---|
description | basic, easy to use | best performance, stable, recommand | best performance, for trial | slow, high-security |
name | the official model name (e.g. gpt-3.5-turbo,qwen-max) | XGenerationLab/XiYanSQL-QwenCoder-32B-2412 | xiyansql-qwencoder-32b | xiyansql-qwencoder-3b |
key | the API key of the service provider (e.g. OpenAI, Alibaba Cloud) | the API key of modelscope | the API key via email | "" |
url | the endpoint of the service provider (e.g."https://api.openai.com/v1") | https://api-inference.modelscope.cn/v1/ | https://xiyan-stream.biz.aliyun.com/service/api/xiyan-sql | http://localhost:5090 |
General LLMs
if you want to use the general LLMs, e.g. gpt3.5, you can directly config like this:
if you want to use Qwen from alibaba, e.g. Qwen-max, you can use following config.
Text-to-SQL SOTA model
We recommend the XiYanSQL-qwencoder-32B (https://github.com/XGenerationLab/XiYanSQL-QwenCoder), which is the SOTA model in text-to-sql, see Bird benchmark. There are two ways to use the model. You can use either of them. (1) Modelscope, (2) Alibaba Cloud DashScope.
(1) Modelscope version
You need to apply a key
of API-inference from Modelscope, https://www.modelscope.cn/docs/model-service/API-Inference/intro
Then you can use the following config:
Read our model description for more details.
(2) Dashscope version
We deployed the model on Alibaba Cloud DashScope, so you need to set the following environment variables:
Send me your email to get the key
. ( godot.lzl@alibaba-inc.com )
In the email, please attach the following information:
We will send you a key
according to your email. And you can fill the key
in the yml file.
The key
will be expired by 1 month or 200 queries or other legal restrictions.
Note: this model service is just for trial, if you need to use it in production, please contact us.
Alternatively, you can also deploy the model XiYanSQL-qwencoder-32B on your own server.
Local Model
Note: local model is slow (about 12 seconds per query on my macbook). If your need stable and fast service, we still recommend to use the modelscope version.
To run xiyan_mcp_server on local mode, you need
- a PC/Mac with at least 16GB RAM
- 6GB disk space
step1: Install additional python packages
step2: (optional) manully download the model We recommand xiyansql-qwencoder-3b. You can manully download the model by
It will take you 6GB disk space.
step4: download the script and run server. src/xiyan_mcp_server/local_xiyan_server.py
The server will be running on http://localhost:5090/
step4: prepare config and run xiyan_mcp_server the config.yml should be like:
Til now the local mode is ready.
Database Configuration
host
, port
, user
, password
, database
are the connection information of the database.
You can use local or any remote databases. Now we support MySQL and PostgreSQL(more dialects soon).
MySQL
PostgreSQL
step1: Install python packages
step2: prepare the config.yml like this:
Note that dialect
should be postgresql
for postgresql.
Launch
Claude desktop
Add this in your claude desktop config file, ref claude desktop config example
Cline
prepare the config like Claude desktop
Goose
Add following command in the config, ref goose config example
Cursor
Use the same command like Goose .
Witsy
Add following in command.
Add an env: key is YML and value is the path to your yml file. Ref witsy config example
It does not work!
contact us: Ding GroupιιηΎ€ο½ Follow me on Weibo
Citation
If you find our work helpful, feel free to give us a cite.
This server cannot be installed
A Model Context Protocol server that enables natural language queries to databases, powered by XiYan-SQL which is a state-of-the-art text-to-SQL model.