aashiqmuhamed
AI Researcher at CMU School of Computer Science | Applied Scientist at AWS AI | Stanford MS
Carnegie Mellon UniversityPittsburgh
aashiqmuhamed's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
yoheinakajima/babyagi
stitionai/devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI. [⚠️ DEVIKA DOES NOT HAVE AN OFFICIAL WEBSITE ⚠️]
aiwaves-cn/agents
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
OpenBMB/AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
confident-ai/deepeval
The LLM Evaluation Framework
comet-ml/opik
From RAG chatbots to code assistants to complex agentic pipelines and beyond, build LLM systems that run better, faster, and cheaper with tracing, evaluations, and dashboards.
openai/simple-evals
facebookresearch/large_concept_model
Large Concept Models: Language modeling in a sentence representation space
valeman/Awesome_Math_Books
srush/awesome-o1
A bibliography and survey of the papers surrounding o1
HumanSignal/Adala
Adala: Autonomous DAta (Labeling) Agent framework
acl-org/acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
huggingface/picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
haizelabs/llama3-jailbreak
A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
composable-models/llm_multiagent_debate
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate
davidjurgens/potato
potato: portable text annotation tool
yixuantt/MultiHop-RAG
Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)
google-deepmind/onetwo
Infini-AI-Lab/MagicPIG
MagicPIG: LSH Sampling for Efficient LLM Generation
OSU-NLP-Group/AgentSafety
ApolloResearch/e2e_sae
Sparse Autoencoder Training Library
primeqa/clapnq
adamkarvonen/SAEBench
awslabs/rag-qa-arena
kalyaniuniversity/MC4
An implementation of Markov Chain Type 4 Rank Aggregation algorithm in Python
NanshineLoong/Self-Evolving-Benchmark
A framework for evolving and testing question-answering datasets with various models.
THU-KEG/KNOT
aghyad-deeb/unlearning_evaluation
Code for the paper "Do Unlearning Methods Remove Information from Language Model Weights?"