de en es ja ko ru zh

mcp-server-cloudflare

Official

by cloudflare

TypeScript

Apache 2.0

1,307

3,023

Overview InspectNew Endpoints Schema Related Servers Reviews Score

Need Help?View Source Code Report Issue

scorers.ts•2.53 kB

import { generateObject } from 'ai' import { z } from 'zod' import { factualityModel } from './test-models' import type { ScoreFn } from 'vitest-evals' /** * Checks the factuality of a submission, using * OpenAI's GPT-4o model. */ export const checkFactuality: ScoreFn = async ({ input, expected, output }) => { const { model } = factualityModel const { object } = await generateObject({ model, /** * Prompt taken from autoevals: * * {@link https://github.com/braintrustdata/autoevals/blob/5aa20a0a9eb8fc9e07e9e5722ebf71c68d082f32/templates/factuality.yaml} */ prompt: ` You are comparing a submitted answer to an expert's rubric on a given question. Here is the data: [BEGIN DATA] ************ [Question]: ${input} ************ [Expert Rubric]: ${expected} ************ [Submission]: ${output} ************ [END DATA] Submissions contain message metadata inside of the <message_content> XML tags. The attribute \`type=text\` indicates text content. The attribute \`type=tool-call\` indicates a tool call. Use this metadata to determine the accuracy of the response. Compare the factual content of the submitted answer with the expert's answer rubric. Ignore any differences in style, grammar, or punctuation. The submitted answer may either be a subset or superset of the expert's expected answer, or it may conflict with it. Determine which case applies. Answer the question by selecting one of the following options: (A) The submitted answer is a subset of the answer the expert's rubric describes and is fully consistent with it. (B) The submitted answer is a superset of the answer the expert's rubric describes and is fully consistent with it. (C) The submitted answer contains all the same details of the answer the expert's rubric describes. (D) There is a disagreement between the submitted answer and the expert's rubric. (E) The answers differ, but these differences don't matter from the perspective of factuality. `, schema: z.object({ answer: z.enum(['A', 'B', 'C', 'D', 'E']).describe('Your selection.'), rationale: z.string().describe('Why you chose this answer. Be very detailed.'), }), }) /** * LLM's are well documented at being poor at generating */ const scores = { A: 0.4, B: 1, C: 1, D: 0, E: 1, } return { score: scores[object.answer], metadata: { rationale: object.rationale, }, } }

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/cloudflare/mcp-server-cloudflare'

If you have feedback or need assistance with the MCP directory API, please join our Discord server