[RF]

Research-Focused Agent Frameworks

Academic and research-oriented frameworks from leading institutions like Google Research, Stanford, and UC Berkeley, advancing the state-of-the-art in agent systems.

16 Entries GitHub Stats Available
Showing 16 of 16 entries

Autonomous research workflow that helps generate ideas, conduct literature review, write and run experiment code, create reports, and publish or retrieve work through the AgentRxiv collaborative research mechanism.

Research Python 5589 stars MIT
#research #autonomous-agent #science #open-source #workflow

A recommender system simulator that utilizes 1,000 LLM-empowered generative agents for research into recommendation systems. Agent4Rec provides a comprehensive research environment for studying agent-based recommendation algorithms and user behavior modeling.

Framework 483 stars

A multi-agent AI system built with Gemini 2.0 designed to function as a virtual scientific collaborator. AI Co-Scientist uses specialized agents (Generation, Reflection, Ranking, Evolution, Proximity, and Meta-review) to generate novel research hypotheses, conduct literature reviews, and formulate research proposals with automated feedback loops.

Framework

Sakana AI research system for automatically generating, testing, and optimizing CUDA kernels. The agentic workflow translates PyTorch operations into CUDA code and iteratively improves performance through evaluation.

Research
#research #autonomous-agent #cuda #coding-assistant #optimization

Sakana AI's autonomous research agent that generates research ideas, writes experiment code, runs studies, analyzes results, and produces paper-style reports with minimal human intervention.

Research Python 13602 stars
#research #autonomous-agent #science #open-source #automation

A research framework that introduces agents that role-play to solve tasks collaboratively through conversational dynamics. CAMEL enables agents to take different roles (user, assistant) and drive problem-solving through dialogue, with implications for training, simulations, and AI alignment research.

Framework 16955 stars

Self-improving coding-agent research system from Sakana AI and collaborators. The Darwin Gödel Machine rewrites its own code, evaluates variants on programming benchmarks, and archives successful improvements for open-ended exploration.

Research 2043 stars Apache-2.0
#research #self-improving #coding-assistant #open-source #benchmark

A groundbreaking research initiative that leverages advanced agent-based APIs to create self-organizing, ethically governed ecosystems of AI agents. HAAS features hierarchical control mechanisms with specialized roles including Supreme Oversight Board and Executive Agents for autonomous system governance.

Framework 3097 stars

Foundation action model for generalist GUI agents. OS-Atlas is trained for screen understanding and action prediction across desktop, mobile, and web interfaces, and is used for downstream computer-use agent research.

Research 445 stars Apache-2.0
#research #gui-agent #computer-use #vision-language-model #benchmark

Google DeepMind research prototype for a universal multimodal AI agent that can see, hear, remember context, and respond in real time through phone and glasses-style experiences.

Research
#research #google #multimodal #real-time #autonomous-agent

Altera.AL's large-scale multi-agent simulation project in Minecraft. Project Sid studies emergent social behavior from many concurrent AI agents, including cooperation, organization, and simulated community dynamics.

Research 1297 stars
#research #multi-agent #simulation #open-source #autonomous-agent

A research assistant that uses AI to analyze citation statements and help researchers better discover, evaluate, and understand research. Scite Assistant employs deep learning techniques to extract and classify citations based on their intent, supporting comprehensive literature reviews and combating reproducibility challenges.

Framework

An AI-powered research tool that integrates with Clarivate's Academic AI Platform to provide literature review assistance, research analytics, and metadata analysis. The platform uses curated academic data and serves over 3,000 institutions with AI agents designed for academic workflows.

Framework

Realistic benchmark and environment for evaluating autonomous web agents on reproducible tasks across self-hosted web applications. WebArena is widely used for measuring multi-step web task completion.

Research 1469 stars Apache-2.0
#research #benchmark #web-agent #evaluation #browser-automation

OpenAI research system that trained GPT-3 to answer long-form questions by browsing the web and citing sources. WebGPT is a foundational browser-assisted question-answering agent and influenced later web-enabled LLM systems.

Research
#research #web-agent #browser #openai #rlhf

ServiceNow research benchmark for testing web agents on enterprise knowledge-work tasks. WorkArena evaluates agents across ServiceNow workflows such as incidents, tasks, knowledge bases, and business process navigation.

Research 249 stars
#research #benchmark #web-agent #evaluation #enterprise