Pinned Repositories
alexa-teacher-models
auto-cot
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
chronos-forecasting
Chronos: Pretrained Models for Time Series Forecasting
earth-forecasting-transformer
Official implementation of Earthformer
mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
patchcore-inspection
RAGChecker
RAGChecker: A Fine-grained Framework For Diagnosing RAG
RefChecker
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
siam-mot
SiamMOT: Siamese Multi-Object Tracking
Amazon Science's Repositories
amazon-science/chronos-forecasting
Chronos: Pretrained Models for Time Series Forecasting
amazon-science/JuLS
JuLS is a Julia Local Search solver that combines Constraint Based Local Search (CBLS) and Constraint Programming (CP)
amazon-science/SWE-PolyBench
SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents
amazon-science/carbon-assessment-with-ml
CaML: Carbon Footprinting of Household Products with Zero-Shot Semantic Text Similarity
amazon-science/Cyber-Zero
Cyber-Zero: Training Cybersecurity Agents Without Runtime
amazon-science/BartGraphSumm
Implementation of the paper "Efficiently Summarizing Text and Graph Encodings of Multi-Document Clusters (NAACL 2021)"
amazon-science/Automatic-Table-to-Graph-Generation
amazon-science/LatticeAlgorithms.jl
Algorithms to solve lattice problems in Julia
amazon-science/MILE
amazon-science/dstc12-controllable-conversational-theme-detection
Data & code for DSTC12, Controllable Conversational Theme Detection track
amazon-science/MigrationBench
amazon-science/sumren
amazon-science/fmcore
Running GenAI models at every scale, on every modality
amazon-science/long-context-hallucination-detection
amazon-science/causal-validation
Validate your causal models!
amazon-science/wraval
WRAVAL helps in evaluating LLMs for writing assistant tasks like summarization, professional tone, witty tone, etc.
amazon-science/BOPRO-ICLR-2025
amazon-science/collage
amazon-science/Generative-vs-Discriminative-Classifiers
amazon-science/nowcasting-recession-risk
amazon-science/CTF-Dojo
Training Language Model Agents to Find Vulnerabilities with CTF-Dojo
amazon-science/MemInsight
amazon-science/QualityFlow
amazon-science/serializeEM
amazon-science/FairGen
amazon-science/application-eval-data
amazon-science/DiverseAgentEntropy
amazon-science/LARCQ
Codes of LARCQ Paper (Interspeech 2025)
amazon-science/MDSEval
amazon-science/weak-supervision-for-few-shot-absa