Pinned Repositories
alexa-teacher-models
auto-cot
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
chronos-forecasting
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
earth-forecasting-transformer
Official implementation of Earthformer
mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
patchcore-inspection
RAGChecker
RAGChecker: A Fine-grained Framework For Diagnosing RAG
RefChecker
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
siam-mot
SiamMOT: Siamese Multi-Object Tracking
Amazon Science's Repositories
amazon-science/chronos-forecasting
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
amazon-science/cceval
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)
amazon-science/JuLS
JuLS is a Julia Local Search solver that combines Constraint Based Local Search (CBLS) and Constraint Programming (CP)
amazon-science/object-centric-learning-framework
amazon-science/SWE-PolyBench
SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents
amazon-science/carbon-assessment-with-ml
CaML: Carbon Footprinting of Household Products with Zero-Shot Semantic Text Similarity
amazon-science/BartGraphSumm
Implementation of the paper "Efficiently Summarizing Text and Graph Encodings of Multi-Document Clusters (NAACL 2021)"
amazon-science/Cyber-Zero
Cyber-Zero: Training Cybersecurity Agents Without Runtime
amazon-science/cocomic
CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context
amazon-science/generalized-fairness-metrics
amazon-science/llm-asymptotic-decoding
amazon-science/Automatic-Table-to-Graph-Generation
amazon-science/dstc12-controllable-conversational-theme-detection
Data & code for DSTC12, Controllable Conversational Theme Detection track
amazon-science/h3-indexer
The h3-indexer is an open source package for indexing geospatial data using PySpark, Apache Sedona and the H3 hierarchical spatial indexing system. The h3-indexer maps any number of vector-type geospatial data sets to H3 grids for efficient spatial analysis and querying.
amazon-science/LatticeAlgorithms.jl
Algorithms to solve lattice problems in Julia
amazon-science/MigrationBench
amazon-science/fair-pca
amazon-science/fmcore
Running GenAI models at every scale, on every modality
amazon-science/long-context-hallucination-detection
amazon-science/wraval
WRAVAL helps in evaluating LLMs for writing assistant tasks like summarization, professional tone, witty tone, etc.
amazon-science/BOPRO-ICLR-2025
amazon-science/collage
amazon-science/supervised-intent-clustering
This is a package to fine-tune language models in order to create clustering-friendly embeddings.
amazon-science/MemInsight
amazon-science/QualityFlow
amazon-science/serializeEM
amazon-science/application-eval-data
amazon-science/LARCQ
Codes of LARCQ Paper (Interspeech 2025)
amazon-science/plan-guided-summarization
amazon-science/weak-supervision-for-few-shot-absa