[ST]

Specialized AI Agent Tools

Domain-specific agent tools designed for particular use cases like customer service, sales, marketing, data analysis, security, and DevOps automation.

154 Entries GitHub Stats Available
Showing 154 of 154 entries

Open-source technical standard developed by a coalition including Scope3, Yahoo, and PubMatic, designed to allow AI agents from advertisers, publishers, and ad tech platforms to communicate and autonomously execute advertising tasks.

Tool

Agentic AI assistant integrated into Adobe Express that allows users to create and edit designs through natural, conversational language, aimed at non-design professionals.

Tool

Enterprise tool integration platform designed to connect AI agents with business applications and workflows. Provides secure, managed connections between agents and enterprise systems with authentication handling, compliance controls, and centralized tool management for production deployments.

Tool

Official direct benchmark harness from the AutoGPT project for evaluating autonomous agent performance across diverse tasks without Agent Protocol server overhead. Provides standardized challenge suites, scoring workflows, and CLI tooling for comparing agent capabilities on planning, reasoning, tool use, and task completion.

Tool

Comprehensive benchmark for evaluating LLM-as-Agent capabilities across 8 distinct environments including coding, gaming, web browsing, and household tasks. Provides standardized evaluation protocols, multi-dimensional metrics, and leaderboards for comparing agent performance across diverse real-world scenarios.

Tool 3418 stars

Monitoring and analytics platform designed specifically for autonomous AI agents. Provides real-time tracking of agent behaviors, decision patterns, and performance metrics with anomaly detection and comprehensive dashboards for production agent systems.

Tool

AI-powered advertising technology platform from LG Ad Solutions, designed to deploy and coordinate internal and external agents for automating operational workflows and data collaboration in Connected TV advertising.

Tool

Tool library and integration framework providing 70+ reusable tools for AI agents including image processing, OCR, search, and data analysis capabilities. Offers standardized tool interfaces compatible with LangChain, Transformers Agents, and other frameworks with easy-to-use APIs and comprehensive documentation.

Tool 412 stars

Open-source Python SDK for AI agent observability providing session replays, metrics, and monitoring for LangChain, CrewAI, and AutoGen. Features detailed agent execution tracking, LLM call logging, cost analysis, and performance metrics for debugging and optimizing multi-agent systems.

Tool 5545 stars

Open-source observability framework for monitoring and debugging AI agent systems. Features execution tracing, state inspection, and visual debugging tools for understanding agent behaviors and optimizing performance in multi-agent environments.

Tool 319 stars

Modular benchmark and development platform for evaluating and building LLM agents. Features customizable evaluation pipelines, standardized metrics, and tooling for systematic agent testing across reasoning, planning, and execution capabilities.

Tool 226 stars

Local-first TUI and CLI for AI coding-agent trace logs. Agenttrace turns local agent traces into cost, token, latency, and health regression reports for debugging coding-agent workflows.

Tool Free MIT
#observability #monitoring #tracing #evaluation #cli #coding-assistant #local-first #terminal

AI-powered sales assistant designed to enhance sales strategies and efficiency, featuring a Memory Module and Online Mode for smarter selling.

Tool

Open-source Base Sepolia testnet settlement rails for humans and AI agents to hire AI agents with EIP-712 signed offers, USDC escrow, proof submission, and programmable release/refund/dispute lifecycle. Includes a CLI, JavaScript SDK, read-only MCP server, and x402 compatibility notes for composing pay-per-call access with escrowed outcome-based work.

Tool Free TypeScript, Solidity ISC
#payments #mcp #sdk #developer-platform #automation

Official Apify MCP server that lets agents run Apify Actors for web scraping, crawling, search, maps, ecommerce, and social-media data extraction through the MCP tool interface.

Tool 1219 stars MIT
#mcp #web-scraping #automation #tools #data-extraction

Open-source data labeling and feedback platform for building LLM datasets, preference data, RLHF workflows, and evaluation sets with collaboration between AI engineers and domain experts.

Tool 4971 stars Apache-2.0
#open-source #annotation #data-labeling #rlhf #evaluation

Open-source AI observability platform for evaluating, troubleshooting, and monitoring LLM applications and agents. Provides experiment tracking, prompt tracing, retrieval analysis, and LLM evaluations with support for traces, spans, and comprehensive debugging tools for production AI systems.

Tool

AI-powered B2B sales automation platform featuring Ava, an AI Sales Development Representative that automates over 80% of outbound tasks including lead discovery, email personalization, and meeting scheduling.

Tool

Official Asana MCP server for giving AI assistants access to the Asana Work Graph. It supports authenticated interactions with tasks, projects, portfolios, teams, and project-management workflows.

Tool Freemium
#mcp #project-management #productivity #official #tools

LLM monitoring and evaluation platform for production AI applications. Athina provides real-time monitoring, batch evaluations, hallucination checks, PII checks, dataset evaluation, and RAG pipeline quality workflows.

Platform Freemium
#observability #evaluation #monitoring #hallucination-detection #rag

Python SDK for Atla Insights, a platform for monitoring and improving AI agents. It helps developers instrument agent workflows, capture evaluation data, and feed results into Atla's agent monitoring stack.

Tool Freemium Python 7 stars Apache-2.0
#evaluation #testing #llm #safety #enterprise

Atlassian's Rovo MCP server for connecting agents to Jira, Confluence, Compass, and Atlassian work data. It supports search, summarization, issue creation, page updates, and remote MCP access with enterprise controls.

Tool 689 stars Apache-2.0
#mcp #project-management #enterprise #official #developer-platform

Automated evaluation and governance platform for generative AI systems. Aymara generates policy-grounded safety, accuracy, fairness, and compliance evaluations, scores model or application responses, and helps teams monitor and improve deployed AI behavior.

Platform Freemium
#evaluation #safety #governance #benchmark #multimodal

Open-source Python library for synthetic data curation, post-training data generation, and structured data extraction. Bespoke Curator helps teams build scalable LLM-powered data pipelines with async execution, caching, fault recovery, interactive viewing, and dataset curation recipes.

Tool 1675 stars Apache-2.0
#synthetic-data #data-generation #fine-tuning #python #open-source

Enterprise voice AI platform for automating inbound and outbound phone calls. Bland AI provides phone agents, call APIs, webhooks, workflow integrations, simulations, regression testing, analytics, and custom voice experiences for large-scale customer communication.

Platform Paid
#voice-agents #contact-center #enterprise #customer-service #sales

Enterprise-grade AI product stack providing evaluations, prompt playground, logging, and dataset management for AI agents. Offers end-to-end workflow for building reliable AI products with continuous evaluation, prompt optimization, and production monitoring capabilities.

Tool

AI-powered phone call automation for scheduling built into Cal.com, featuring customizable human-like conversations that reduce no-shows and boost conversions. Allows users to assign dedicated phone numbers, write custom script prompts, define agent personality and tone, trigger calls on form submission or before meetings, and automate booking workflows at $0.29 per minute.

Tool

Real-time speech generation API optimized for conversational voice agents. Cartesia Sonic provides low-latency text-to-speech, expressive voices, voice cloning, multilingual output, and streaming APIs used as the voice layer in interactive agent pipelines.

Platform Freemium
#voice-agents #real-time #api #developer-platform #hosted-service

End-to-end quality assurance platform for conversational AI agents providing automated testing, observability, and monitoring for voice and chat bots. Covers full agent lifecycle from pre-production simulation to post-deployment analytics with real-time failure alerts and regression tracking.

Platform Enterprise Web
#testing #observability #qa #monitoring #voice-agents #chat-agents #analytics

Open-source search infrastructure and embedding database for AI applications. Chroma supports vector, full-text, metadata, and hybrid search locally or through Chroma Cloud, making it common infrastructure for RAG prototypes and production applications.

Tool Freemium 27956 stars Apache-2.0
#vector-stores #database #rag #search #open-source

Open-source document intelligence API for layout analysis, OCR, and semantic chunking. Chunkr converts PDFs, presentations, Word documents, and images into structured HTML, markdown, or JSON chunks for RAG and LLM pipelines.

Tool Freemium 2941 stars AGPL-3.0
#rag #document-ai #data-extraction #open-source #hosted-service

Integrated edge computing platform announced November 2025 for distributed agentic AI workloads, combining compute, networking, and storage into a single modular system. Features CPU and GPU configurations, up to 120TB storage, redundant power and cooling, integrated 25-gigabit networking, zero-touch deployment, and pre-validated blueprints designed for real-time AI inferencing from retail stores to healthcare facilities to factory floors.

Tool

Specialized version of Anthropic Claude model aimed at supporting the entire scientific process, featuring new connectors to scientific platforms like Benchling to assist with research and discovery.

Tool

API gateway for AI agents that exposes developer services such as web scraping, screenshots, DNS lookup, geolocation, crypto prices, code execution, storage, scheduling, and x402/USDC payments through a unified OpenAPI-described service surface.

Platform Freemium REST API, OpenAPI, MCP Proprietary

AI-powered sales platform that helps teams enrich, score, and automatically message leads. Features AI research agents that browse the web and gather information to create personalized outreach at scale.

Tool

Official ClickUp MCP server for connecting AI assistants to ClickUp workspace data such as tasks, lists, folders, docs, and project workflows through authenticated MCP access.

Tool Freemium
#mcp #project-management #productivity #official #tools

Cloudflare's AI gateway proxy for monitoring, controlling, and optimizing traffic between applications and model providers. It provides request logs, analytics, caching, rate limiting, retries, model fallback, and edge deployment options for production AI applications.

Platform Freemium
#gateway #cloudflare #edge #observability #developer-platform

Official Cloudflare MCP server exposing Cloudflare services as tools for agents and AI assistants. It enables natural-language workflows for Workers, storage, domains, and other Cloudflare platform resources.

Tool 3740 stars Apache-2.0
#mcp #cloudflare #cloud-native #official #developer-platform

AI-powered code analysis service that automatically flags security vulnerabilities, provides performance recommendations, and explains code issues with machine learning. Integrates with popular development tools and CI/CD pipelines.

Tool

Open-source memory and knowledge infrastructure for AI agents. Cognee turns documents and conversations into graph- and vector-backed memory that agents can retrieve from and reason over.

Tool Python 17223 stars Apache-2.0
#open-source #memory #knowledge-graph #rag #python

Conversational AI platform focused on enterprise customer service automation. Enables creation of intelligent virtual agents with advanced natural language processing and seamless integration with existing business systems.

Tool

Platform and SDK for connecting AI agents to external tools, authenticated apps, and sandboxed workbench environments. Composio provides toolkits, tool search, context management, auth handling, and integrations for production agents that need to act across third-party services.

Tool Freemium 28235 stars MIT
#integrations #authentication #tool-calling #developer-platform #mcp

Enterprise security platform designed to secure autonomous AI agents themselves, addressing the new security risks and access-management challenges created when agents are deployed in sensitive environments.

Tool

Application performance monitoring platform with AI-powered agents for distributed tracing, root cause analysis, and proactive application improvement. Features Bits AI for data querying and automated remediation suggestions.

Tool

Open-source LLM evaluation framework with a pytest-like interface. DeepEval provides metrics for RAG, hallucination, answer relevance, bias, and custom criteria, with Confident AI offering a managed evaluation platform.

Tool Python 15410 stars Apache-2.0
#open-source #evaluation #testing #python #rag

Unified voice agent API that combines Deepgram speech-to-text, text-to-speech, and LLM orchestration for real-time conversational AI. It supports streaming audio, interruption handling, function calls, and developer controls for building responsive voice agents.

Platform Freemium
#voice-agents #real-time #api #multimodal #developer-platform

Open-source framework from Argilla for building synthetic data and AI feedback pipelines. Distilabel generates and labels datasets with LLMs, supports preference and evaluation data workflows, and provides scalable pipeline primitives for fine-tuning and alignment datasets.

Tool 3217 stars Apache-2.0
#synthetic-data #data-generation #rlhf #fine-tuning #open-source

ElevenLabs platform for building low-latency voice and chat agents with human-like speech, configurable behavior, knowledge sources, web and phone deployment, SDKs, and monitoring. It combines ElevenLabs speech models with agent orchestration for customer-facing voice experiences.

Platform Freemium
#voice-agents #real-time #api #hosted-service #developer-platform

Open-source decision tree-based agentic RAG framework by Weaviate that dynamically displays data, learns from user feedback, and chunks documents on-demand. Features intelligent tool selection with transparent decision-making, context-aware on-the-fly document chunking, feedback-driven learning without cross-user contamination, and both full frontend interface and pip-installable Python package.

Tool

AI meeting assistant and meeting agent for recording, transcribing, summarizing, searching, and acting on meeting content. Fellow can generate follow-ups, update CRM fields, and integrate meeting insights with project tools.

Tool Freemium
#meeting-agent #productivity #enterprise #integrations #knowledge-management

Official Figma MCP server that exposes design context to AI coding agents and allows agents to read design information or write native Figma content back to the canvas.

Tool 1417 stars
#mcp #figma #design-to-code #official #developer-platform

Non-profit research lab building AI agents to automate and scale scientific research, with a primary focus on accelerating discovery in biology and other complex sciences.

Tool

Enterprise LLM evaluation and observability platform for testing, monitoring, and improving AI applications. Galileo focuses on RAG quality, data quality, hallucination detection, and production evaluation workflows.

Platform Enterprise
#observability #evaluation #hallucination-detection #rag #enterprise

Multi-agent framework designed to automate scientific workflows, such as gene expression analysis.

Tool 138 stars

Benchmark for evaluating LLM agents in the domain of gene expression data analysis.

Tool 64 stars

GitHub's official MCP server for repositories, issues, pull requests, code search, Actions, and related GitHub API operations. It enables MCP-compatible agents to inspect and operate on GitHub resources.

Tool 29832 stars MIT
#mcp #github #developer-platform #official #open-source

Official GitLab MCP server for exposing GitLab resources such as projects, repositories, issues, merge requests, and CI/CD information to compatible AI agents and editor assistants.

Tool Free
#mcp #gitlab #developer-platform #devops #official

MCP gateway, model access hub, and server directory for AI applications. Glama provides discovery, hosted MCP tooling, model access, analytics, and cost controls for agent builders.

Platform Freemium
#mcp #registry #gateway #directory #llm

AI-powered observability agent within Grafana Cloud that assists with investigations, incident response, and system monitoring. Uses LLMs to analyze metrics, logs, and traces, providing intelligent insights and automated root cause analysis for complex distributed systems.

Tool

Open-source framework from Zep for building real-time temporal knowledge graphs for AI agents. Graphiti extracts entities, relationships, facts, and time-aware memory from conversations and external data.

Tool Python 26065 stars Apache-2.0
#open-source #memory #knowledge-graph #rag #python

Leading AI platform for legal work, providing generative AI agents trained on legal data to assist law firms and in-house teams with tasks like document review, contract analysis, due diligence, and legal research.

Tool

Open-source observability platform for AI agents offering one-line integration for logging, monitoring, and debugging LLM applications. Features request logging, cost tracking, latency monitoring, caching, rate limiting, and prompt versioning with support for all major LLM providers.

Tool

GTM Intelligence platform with AI agents (Odin and Nova) that analyze buyer journeys, connect to GTM tech stack, provide account and lead scoring, touchpoint analysis, and actionable recommendations without coding.

Tool

AI evaluation and observability platform for production LLM apps. HoneyHive records traces, manages evaluation datasets, supports human annotation, and runs regression tests for prompt and agent changes.

Platform Freemium
#observability #evaluation #dataset-management #annotation #tracing

Hume's Empathic Voice Interface for building voice AI that can understand and respond to vocal emotion in real time. EVI combines speech recognition, emotion understanding, language modeling, and voice output for emotionally responsive conversational agents.

Platform Freemium
#voice-agents #real-time #api #multimodal #developer-platform

IBM Research project for observability and debugging of agentic AI systems. Provides tools for tracing agent reasoning, visualizing decision trees, and analyzing multi-agent interactions for research and production deployments.

Tool

AI agent built with Google Gemini models embedded in the Xvantage distribution platform, designed to provide actionable daily briefs and data-driven recommendations to sales teams.

Tool

Open-source framework for large language model evaluations from the UK AI Safety Institute. Inspect AI supports multi-turn tasks, agent evaluations, sandboxed code execution, scorers, datasets, and reproducible eval runs.

Tool 2058 stars MIT
#open-source #evaluation #testing #safety #sandbox

AI-powered email outreach platform that automates sales prospecting with unlimited email account connections, AI personalization, and campaign analytics. Focuses on scaling cold email outreach with deliverability optimization.

Tool

Enterprise AI marketing platform that creates brand-aligned content across channels. Features specialized marketing agents for content creation, campaign development, and brand voice consistency.

Tool

JetBrains MCP server plugin for exposing IDE context, project structure, files, and development actions from JetBrains IDEs to MCP-compatible AI assistants and coding agents.

Tool Free
#mcp #jetbrains #coding-assistant #developer-platform #tools

Intelligent data analyst tool that interprets, analyzes, and visualizes complex data with strong encryption and security. Provides user-friendly data processing with automated insights generation and secure data handling.

Tool

AI gateway capability in Kong's API platform for routing AI requests through a provider-agnostic API. Kong AI Gateway centralizes credentials, request routing, prompt and response controls, semantic caching, token-aware policies, and enterprise governance for AI API traffic.

Platform Enterprise
#gateway #enterprise #api #model-routing #governance

AI-powered API testing platform that generates and runs comprehensive test suites automatically. Integrates with CI/CD pipelines to provide continuous testing and crash-free releases with AI-analyzed results.

Tool

Open-source embedded retrieval library and vector database for multimodal AI applications. LanceDB is built on the Lance columnar format and supports vector search, full-text search, hybrid search, and local or cloud retrieval workflows.

Tool Freemium 10301 stars Apache-2.0
#vector-stores #database #rag #multimodal #open-source

Serverless AI developer platform for building and deploying AI agents, apps, and features. Langbase provides composable AI primitives, memory, tools, model routing, and infrastructure for production LLM applications.

Platform Freemium
#serverless #developer-platform #memory #tools #orchestration

Open-source LLM observability and analytics platform providing tracing, prompt management, evaluation, and analytics for AI agents. Features detailed execution traces, cost tracking, quality metrics, and collaborative prompt versioning for debugging and optimizing agentic systems in production.

Tool

LangChain's observability, tracing, and evaluation platform for LLM applications and agents. LangSmith records chain and tool traces, manages datasets, runs evaluations, and supports debugging and regression testing.

Platform Freemium
#observability #evaluation #tracing #dataset-management #langchain

Open-source observability platform for LLM applications built on OpenTelemetry standards. Provides distributed tracing, metrics collection, prompt management, and evaluation tools with support for all major agent frameworks and LLM providers.

Tool 1203 stars

Open-source agent engineering, prompt management, and evaluation platform. Latitude version-controls prompts, runs automated evaluations, tracks regressions, and supports collaboration around LLM and agent workflows.

Platform 3998 stars LGPL-3.0
#open-source #evaluation #prompt-management #sdk #developer-platform

Linear's MCP integration for connecting Claude and other compatible agents to Linear issues, projects, comments, and project-management workflows through secure authenticated access.

Tool Freemium
#mcp #project-management #productivity #official #tools

Open-source Python SDK and proxy server that exposes a unified OpenAI-compatible API for 100+ LLM providers. LiteLLM Proxy acts as an AI gateway with logging, cost tracking, retries, rate limits, load balancing, guardrails, and provider failover.

Tool Freemium 47000 stars MIT
#gateway #api #model-routing #llm #open-source

LlamaIndex's managed document parsing service for turning complex documents into AI-ready data. LlamaParse handles PDFs, tables, charts, handwriting, checkboxes, images, and many file formats, returning clean markdown, text, or JSON for RAG and agent pipelines.

Platform Freemium
#rag #document-ai #data-extraction #hosted-service #llm

LLM observability and prompt management platform for tracking prompts, traces, costs, user feedback, evaluations, and analytics across AI products. Lunary provides SDKs and hosted monitoring for production LLM applications.

Platform Freemium
#observability #monitoring #analytics #prompt-management #tools

AI-native search and discovery platform for ecommerce teams. Marqo uses semantic search, personalization, clickstream, purchase, and event data to improve product search relevance, recommendations, conversion, and merchandising workflows.

Platform Paid
#search #ecommerce #data-platform #hosted-service #multimodal

AI observability and evaluation platform for testing prompts, running simulations, managing datasets, monitoring production behavior, and catching regressions in LLM and agent applications.

Platform Enterprise
#observability #evaluation #testing #dataset-management #monitoring

Official visual testing and debugging tool for MCP servers. MCP Inspector provides a web UI for connecting to a server, browsing tools and resources, and manually executing calls while developing MCP integrations.

Tool 9759 stars
#mcp #debugging #developer-platform #official #open-source

Official repository of Model Context Protocol reference server implementations. It includes examples for common integrations such as filesystem, databases, search, messaging, and browser-adjacent tool access.

Tool 85653 stars
#mcp #open-source #reference #tools #official

Open-source framework for connecting MCP servers to LLM applications and agent clients beyond Claude. mcp-use helps developers build MCP apps and integrate tools with OpenAI, Anthropic, local models, and agent frameworks.

Tool Python 9954 stars MIT
#mcp #open-source #sdk #python #tool-calling

Searchable community directory of MCP servers and clients. mcp.so lists available servers with categories, links, and metadata for discovering MCP tools across the ecosystem.

Platform Free
#mcp #directory #registry #community #tools

Universal self-improving memory layer for AI agents and LLM applications, enabling personalized AI interactions with just three lines of code. Features long-term, short-term, semantic, and episodic memory types, integrates with OpenAI, LangGraph, CrewAI, and selected as exclusive memory provider for AWS Agent SDK. Achieves 26% improvement in LLM-as-a-Judge metrics with 91% lower p95 latency and 90% token cost savings.

Tool

Open-source, cloud-native vector database for scalable approximate nearest-neighbor search over high-dimensional data. Milvus supports large-scale vector indexing, distributed deployments, multimodal search, and managed Zilliz Cloud deployments for RAG and AI search workloads.

Tool Freemium 44295 stars Apache-2.0
#vector-stores #database #rag #distributed #open-source

Open-source analytics and evaluation platform for voice AI agents, functioning as Mixpanel for conversational AI with auto-generation of interactive call flow visualizations. Enables developers to analyze, visualize, evaluate, and optimize conversational AI performance by understanding common user paths, behaviors, and agent interaction patterns for continuous improvement.

Tool

Serverless cloud platform for running AI workloads, agents, sandboxes, batch jobs, and model inference from Python. Modal is commonly used to host code execution, tool execution, and GPU-backed agent infrastructure.

Platform Freemium
#serverless #cloud-native #sandbox #python #developer-platform

Open protocol for connecting AI applications and agents to external tools, data sources, and prompts. MCP defines a client-server architecture that lets models discover and call capabilities exposed by compatible servers.

Tool
#mcp #protocol #interoperability #tools #open-source

Official SDK collection for implementing MCP clients and servers across languages including Python, TypeScript, Kotlin, Java, C#, Go, Ruby, and Rust. These SDKs provide the base libraries for protocol-compliant MCP integrations.

Tool Free
#mcp #sdk #official #developer-platform #tools

Zero-code open-source platform for auto-generating intelligent agents from natural language prompts through a simple workflow: prompt -> plan -> execute. Eliminates complex orchestration and drag-and-drop requirements while offering powerful agent running control, data processing capabilities, and MCP tool integration for building sophisticated agents without technical expertise.

Tool 4472 stars

Agent-first search engine that indexes 8,000+ MCP servers and other agent-readable services ranked across 7 agentic readiness signals (llms.txt, OpenAPI, ai-plugin, MCP, structured API, robots.txt, schema.org). Useful as an agent-discovery primitive — one agent can query NHS to find another agent to delegate work to. Includes verify_mcp live JSON-RPC probe. Queryable via MCP, REST API, or browser. Listed in the official MCP registry as `ai.nothumansearch/search`.

Tool 1 stars

Official Notion MCP server that gives compatible agents access to Notion pages, databases, blocks, and workspace content for knowledge retrieval and write-back workflows.

Tool 4329 stars MIT
#mcp #productivity #knowledge-management #official #tools

NVIDIA family of open models, training data, and recipes for building specialized AI agents and generating training data. Nemotron includes open weights and model families used for agentic reasoning, synthetic data generation, reward modeling, and fine-tuning workflows.

Tool Free
#nvidia #foundation-models #synthetic-data #rlhf #fine-tuning

Open-source benchmark framework for evaluating web operators and agents on their ability to complete web tasks. Provides transparent, reproducible performance evaluations with WebVoyager30 benchmark dataset covering 30 diverse web tasks.

Tool 49 stars

Autonomous AI security agent powered by GPT-5 that operates as an agentic security researcher to continuously monitor repositories, discover vulnerabilities, assess exploitability, and propose targeted patches.

Tool

OpenAI API surface for low-latency, realtime model interactions over live audio and other streaming inputs. The Realtime API is commonly used to build speech-to-speech voice agents in browsers or servers with WebRTC, WebSocket, tool calling, and multimodal interaction patterns.

Platform Paid
#voice-agents #openai #real-time #multimodal #api

Autonomous research agent specifically tailored for the analysis of health and medical data.

Tool 262 stars

Unified API and model marketplace for accessing hundreds of AI models through an OpenAI-compatible endpoint. OpenRouter supports model discovery, provider routing, fallbacks, price comparison, and pay-per-token access across major model providers.

Platform Paid
#gateway #api #model-routing #foundation-models #developer-platform

Open-source LLM evaluation and observability platform from Comet. Opik traces agentic workflows, RAG systems, and LLM applications, then supports automated evaluation, dashboards, and production monitoring.

Tool 19291 stars Apache-2.0
#open-source #observability #evaluation #tracing #rag

LLMOps platform for prompt management, model routing, experimentation, observability, and deployment workflows. Orq.ai helps teams manage prompt changes, monitor usage, and route requests across models from one platform.

Platform Enterprise
#llm-ops #prompt-management #observability #gateway #model-routing

AI meeting assistant and conversational knowledge engine for meetings. Otter joins meetings, records and transcribes conversations, summarizes action items, answers questions over meeting history, and supports meeting-agent workflows.

Tool Freemium
#meeting-agent #productivity #voice-agents #knowledge-management #enterprise

Automated evaluation, testing, and red-teaming platform for LLM and agent applications. Patronus provides evaluators for hallucination, safety, policy compliance, off-topic behavior, PII, and custom production quality criteria.

Platform Enterprise
#evaluation #safety #testing #hallucination-detection #enterprise

Fully managed vector database for production AI applications. Pinecone provides serverless vector search, metadata filtering, namespaces, automatic indexing, and managed scaling for RAG, agent memory, semantic search, and recommendation workloads.

Platform Freemium
#vector-stores #database #rag #serverless #hosted-service

Managed MCP server from Pipedream that exposes app integrations and workflow actions to AI agents through a hosted MCP endpoint, avoiding local server setup for common SaaS integrations.

Tool Freemium
#mcp #automation #integrations #developer-platform #tools

Enterprise conversational voice AI platform for contact centers. PolyAI builds customer-led voice agents for phone support, reservations, account servicing, payments, and other high-volume customer-service workflows across regulated and global businesses.

Platform Enterprise
#voice-agents #contact-center #enterprise #customer-service #hosted-service

Open-source AI gateway and production platform for routing requests across LLM providers. Portkey adds retries, fallbacks, load balancing, caching, observability, guardrails, prompt management, and model catalog support for production LLM and agent applications.

Platform Freemium 11720 stars MIT
#gateway #model-routing #observability #llm-ops #mcp

Prompt management and LLM observability platform for logging requests, versioning prompts, running prompt experiments, tracking metadata, and monitoring production performance over time.

Platform Freemium
#observability #prompt-management #tracing #evaluation #tools

MCP ecosystem tracking site and newsletter covering MCP servers, clients, releases, and use cases. PulseMCP helps developers follow new tools and ecosystem changes around Model Context Protocol.

Community Free
#mcp #newsletter #community #resources #directory

AI model access and routing platform for configuring model selection across Pulze spaces. Pulze supports model and router configuration, custom routing policies, provider access, cost controls, and reliability settings for AI applications.

Platform Paid
#gateway #model-routing #optimization #llm #hosted-service

Observability platform from Pydantic designed specifically for Python applications and AI agents. Provides structured logging, tracing, and monitoring with type-safe instrumentation, seamless integration with Pydantic models, and powerful debugging capabilities for production systems.

Tool

Open-source vector similarity search engine and vector database written in Rust. Qdrant provides payload filtering, vector search APIs, production indexing, cloud hosting, and managed on-prem options for retrieval and RAG applications.

Tool Freemium 31320 stars Apache-2.0
#vector-stores #database #rag #rust #open-source

Open-source evaluation framework for RAG and LLM applications. RAGAs includes metrics for faithfulness, answer relevance, context precision, context recall, and LLM-as-judge style evaluation.

Tool Python 13916 stars Apache-2.0
#open-source #evaluation #rag #python #testing

Modular framework for building Retrieval-Augmented Generation pipelines with support for Agentic RAG featuring multi-step reasoning and tool usage. Includes seamless MCP Server integration for external tool interaction, customizable LLM providers (OpenAI, Ollama), vector store integration, and support for multiple knowledge sources including local folders and GitHub repositories.

Tool 660 stars

Document parsing and extraction API for converting complex PDFs, spreadsheets, presentations, and scanned documents into structured output for RAG and LLM workflows. Reducto focuses on layout-aware parsing, tables, figures, OCR, splitting, and extraction.

Platform Paid
#rag #document-ai #data-extraction #api #hosted-service

Low-code platform for building AI sales agents and teams that automate lead generation, research, and follow-up processes. Features specialized sales prospecting agents with CRM integration and customizable workflow automation.

Tool

Voice AI platform for building low-latency phone agents for sales, support, scheduling, and service workflows. Retell provides turn-taking, interruption handling, telephony integrations, testing, analytics, and APIs for production inbound and outbound call automation.

Platform Paid
#voice-agents #real-time #customer-service #sales #api

Autonomous AI agent focused on sales automation, designed to handle sales tasks and customer interactions to close deals.

Tool

MCP server for connecting AI agents to Sentry projects, issues, stack traces, releases, and performance data. Sentry MCP lets agents inspect production errors and assist with debugging workflows.

Tool 688 stars
#mcp #sentry #observability #monitoring #debugging

Open-source Conversational Speech Model from Sesame for generating natural conversational speech from text and audio inputs. The model underpins Sesame's voice companion demos and provides a research-grade speech generation component for voice agent experiments.

Tool 14619 stars Apache-2.0
#voice-agents #open-source #multimodal #real-time #research

Specialized platform for building customer service AI agents that deliver empathetic, personalized conversations. Enables agents to take action through CRM integration and order management systems while maintaining brand tone and voice consistency.

Tool

Registry and discovery platform for MCP servers. Smithery catalogs MCP-compatible tools with metadata, installation instructions, and search by integration or capability.

Platform Freemium
#mcp #registry #directory #tools #community

Enterprise AI data development platform for curating training data, evaluating models, optimizing RAG pipelines, and fine-tuning LLMs. Snorkel Flow supports programmatic labeling, SME collaboration, annotation, data quality workflows, and synthetic-data-oriented development.

Platform Enterprise
#data-labeling #synthetic-data #fine-tuning #evaluation #enterprise

Developer security platform that uses DeepCode AI to provide real-time security intelligence, automatically fix vulnerabilities, and integrate security into development workflows. Features AI-powered code analysis and automated remediation capabilities.

Tool

Stripe's official toolkit for building AI-powered products and connecting agents to Stripe payments, customers, invoices, subscriptions, refunds, and related financial workflows, including MCP-compatible tooling.

Tool 1549 stars MIT
#mcp #stripe #payments #official #sdk

Healthcare AI employee platform for hospitals and medical practices. Sully automates clinical, administrative, and patient-operations workflows with agents for scribing, reception, nursing, review replies, review insights, and integrations across EHRs, payments, forms, communications, and analytics systems.

Platform Enterprise
#healthcare #workflow #automation #integrations #contact-center #enterprise

No-code enterprise platform for creating and deploying natural-sounding voice agents with multilingual support for 30+ languages and dialects. Features sub-100ms latency with in-house telephony, HIPAA and GDPR compliance, 200+ enterprise integrations including Salesforce and HubSpot, and white-label capabilities for handling customer support, appointment scheduling, and complex workflows at scale.

Tool

Cloud security platform that uses AI agents for threat detection, vulnerability management, and cloud detection response. Features real-time security intelligence and automated response capabilities for container and Kubernetes environments.

Tool

Enterprise-grade AI speech-to-text platform offering industry-leading transcription accuracy with Word Error Rate under 4%, featuring emotion detection across 7 emotions and purchase intent analysis. Provides secure deployment options across on-premises, public, private, or hybrid cloud with advanced capabilities including dialogue summarization, topic extraction, and PII redaction for customer interaction insights.

Tool

Agentless Contact Center platform achieving 100% automation of level 1 support with 99% accuracy. Features advanced natural language understanding, multi-channel support, and LLM orchestration for enterprise-grade customer service automation.

Tool

Provides advanced AI agents for data analysis including Discover agents for research, Chain of Thought agents for complex problem-solving, and Analyst agents for real-time financial analysis. Features comprehensive workflow automation from data gathering to insight generation.

Tool

Comprehensive benchmark for evaluating LLM agents on tool usage and API interaction capabilities. Features 16,000+ real-world APIs, standardized evaluation metrics, and test scenarios covering tool selection, parameter filling, and multi-step tool orchestration.

Tool 5636 stars

Cloud tool-calling infrastructure for AI agents and LLM apps. Toolhouse provides hosted tools for search, memory, email, browser actions, and code execution through SDKs and integrations with agent frameworks.

Tool Freemium
#tool-calling #sdk #integrations #cloud-native #mcp

Open-source platform for search, recommendations, RAG, and analytics delivered through APIs. Trieve combines vector search, keyword search, ranking, chunk management, and hosted infrastructure for teams adding retrieval to AI products.

Platform Freemium 2653 stars MIT
#rag #search #vector-stores #open-source #hosted-service

Open-source platform for building and running long-running workflows, background jobs, and AI agents in TypeScript and Python. Trigger.dev provides durable execution, retries, queues, observability, and hosted deployment for agentic workloads.

Tool Freemium TypeScript, Python 14923 stars Apache-2.0
#open-source #workflow #orchestration #developer-platform #typescript

Enterprise AI gateway from TrueFoundry that provides a proxy layer between applications, LLM providers, MCP servers, and agents. It offers unified access, observability, governance, access control, routing policies, budget controls, and deployment integration for organization-wide AI usage.

Platform Enterprise
#gateway #model-routing #deployment #enterprise #governance

Serverless vector and full-text search engine built on object storage. turbopuffer supports hybrid search, metadata filtering, automatic scaling, and low-latency retrieval over billions of vectors for RAG, semantic search, and AI application workloads.

Platform Paid
#vector-stores #search #serverless #cloud-native #rag

Open-source AI framework for semantic search, RAG, LLM orchestration, and language-model workflows. txtai combines vector search, sparse retrieval, graph networks, relational storage, pipelines, and workflow orchestration in a Python library.

Tool 12530 stars Apache-2.0
#rag #search #python #workflow #open-source

Open-source document ETL toolkit and enterprise data platform for transforming complex files into clean, structured inputs for language models. Unstructured supports parsing, chunking, enrichment, embedding, and connectors for production RAG pipelines.

Tool Freemium 14709 stars Apache-2.0
#rag #document-ai #data-extraction #data-platform #open-source

Developer-focused voice AI platform for building advanced voice agents with enterprise infrastructure, featuring response times under 500ms and support for 100+ languages. VAPI provides Flow Studio for visual conversational logic design, highly customizable STT/LLM/TTS provider selection, and scalable phone operations for inbound and outbound calls across industries like healthcare, finance, and travel.

Tool

Enterprise AI agent and RAG platform for grounded search, retrieval, governed agent workflows, and factual-consistency enforcement. Vectara provides managed retrieval, citations, hallucination evaluation, policy controls, and deployment options for trusted AI applications.

Platform Enterprise
#rag #search #enterprise #hallucination-detection #governance

Open-source library for building voice-based LLM agents and real-time streaming conversations. Vocode provides abstractions and integrations for speech recognition, language models, text-to-speech, telephony, phone calls, meetings, and voice assistants.

Tool Freemium 3744 stars MIT
#voice-agents #open-source #python #real-time #tools

All-in-one platform for building, testing, and deploying AI voice agents with access to latest super-realistic voice models including Sesame CSM-1B, Dia, and Orpheus. Features optimized compute for real-time inference with sub-200ms time-to-first-token, supports both zero-shot voice cloning and fine-tuning, and provides unified API for multiple voice model integration.

Tool

Open-source, cloud-native vector database for storing objects and vectors together. Weaviate supports vector search, hybrid keyword and vector retrieval, structured filtering, RAG, reranking, and managed Weaviate Cloud deployments.

Tool Freemium 16180 stars BSD-3-Clause
#vector-stores #database #rag #graphql #open-source

W&B's LLM observability and evaluation toolkit for tracing AI application calls, capturing inputs and outputs, managing evaluation datasets, and comparing model or prompt behavior inside the broader Weights & Biases platform.

Platform Freemium
#observability #evaluation #tracing #monitoring #llm

Enterprise generative AI platform designed for content creation, editing, and optimization. Provides AI agents for marketing teams to maintain brand consistency and accelerate content production workflows.

Tool

Zapier's MCP endpoint for giving AI agents access to Zapier's large library of app actions and automations. It lets agents use business apps through a managed, authenticated MCP tool surface.

Tool Freemium
#mcp #automation #integrations #no-code #tools

Memory layer for AI assistants and agents that combines long-term memory, knowledge graph extraction, vector search, and temporal reasoning. Zep provides hosted infrastructure for persistent, contextual agent memory.

Platform Freemium
#memory #knowledge-graph #vector-stores #rag #tools