Pinned Repositories
LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
cam
cache merge tech reference
chinese-poetry
最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
Computer-Archetecture
A Course in 2018 Spring
CSCE-633-WA3
CSrankings
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
data-centric-AI
A curated, but incomplete, list of data-centric AI resources.
LLM_Extension
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
Mooler0410's Repositories
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
Mooler0410/data-centric-AI
A curated, but incomplete, list of data-centric AI resources.
Mooler0410/chinese-poetry
最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
Mooler0410/Computer-Archetecture
A Course in 2018 Spring
Mooler0410/CSrankings
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
Mooler0410/LLM_Extension
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Mooler0410/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
Mooler0410/cam
cache merge tech reference
Mooler0410/CSCE-633-WA3
Mooler0410/csce642-deepRL
Assignments of CSCE-642: Deep Reinforcement Learning offered at Texas A&M University.
Mooler0410/entangled-watermark
Mooler0410/GDCF
Sigir Paper
Mooler0410/gnn-model-explainer
gnn explainer
Mooler0410/HOMER
Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).
Mooler0410/hw-acceptance-unit-test-cycle
Mooler0410/hw-bdd-cucumber
Mooler0410/hw-rails-intro
Mooler0410/hw-ruby-intro
Ruby Introduction Assignment for Agile Development using Ruby on Rails
Mooler0410/ICLR2022-OpenReviewData
Crawl & visualize ICLR papers and reviews
Mooler0410/IMMagician
Mooler0410/know-dont-tell
reference for layer-wise operation
Mooler0410/lgc_for689
Mooler0410/ml-superposition-prompting
Mooler0410/new_metric_for_demographic_parity
[TMLR] Retiring $\Delta \text{DP}$: New Distribution-Level Metrics for Demographic Parity
Mooler0410/PPCompress
Adapting Language Models to Compress Long Contexts
Mooler0410/puguJin
Personal Page
Mooler0410/rottenpotatoes-rails-intro
RottenPotatoes app skeleton for saasbook/hw-rails-intro
Mooler0410/Twitter_misinfo
Mooler0410/world-models
LLMs-Fancy-Viz