Pinned Repositories
aviary
Gymnasium framework for training language model agents on constructive tasks
drugcrow
April 2024 Hackathon Crow Project
LAB-Bench
Evaluation dataset for AI systems intended to benchmark capabilities foundational to scientific research in biology
LitQA
LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer
paper-qa
High accuracy RAG for answering questions from scientific documents with citations
SWE-bench
Fork of upstream
WikiCrow
FutureHouse's Repositories
Future-House/paper-qa
High accuracy RAG for answering questions from scientific documents with citations
Future-House/WikiCrow
Future-House/LitQA
LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer
Future-House/LAB-Bench
Evaluation dataset for AI systems intended to benchmark capabilities foundational to scientific research in biology
Future-House/aviary
Gymnasium framework for training language model agents on constructive tasks
Future-House/drugcrow
April 2024 Hackathon Crow Project
Future-House/ldp
Agent framework for constructing language model agents and training on constructive tasks.
Future-House/SWE-bench
Fork of upstream