Hugginface Reading Group

Welcome to the Huggingface Reading Group! The goal of this group is to have a weekly presentation on research papers/groups of papers. The goal of this repository is to compile all the past presentation write-ups and recordings.

Brief History

This group was started by Huggingface community member James Kelly on 09/26/2023. In the beginning, we "presented" via a summary of papers in discord threads but we started 1/12/2024 to do presentations in discord calls thanks to Phil Butler. The presentations, in general, are targetted for the general audience on the subject of Generative Models but no research papers are off limits.

0: Ambiguity-Aware In-Context Learning with Large Language Models(Presented on 9/27/2023)

Presenter: James Kelly

Paper: Ambiguity-Aware In-Context Learning with Large Language Models

Discord Thread

1: Controlling Neural Networks with Rule Representations(Presented on 10/05/2023)

Presenter: James Kelly

Paper: Controlling Neural Networks with Rule Representations (NeurIPs, 2021)

Code

Discord Thread

2: Understanding Instaflow/Rectified Flow(Presented on 10/11/2023)

Presenter: Isamu Isozaki

Paper: InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Write up

Discord Thread

3: Mysteries of Text Embeddings(Presented on 10/19/2023)

Presenter: Isamu Isozaki

Papers: Text Embeddings Reveal (Almost) As Much As Text+NEFTune: Noisy Embeddings Improve Instruction Finetuning

Discord Thread

4: Training Image Derivatives: Increased Accuracy and Universal Robustness(Presented on 11/08/2023)

Presenter: Vsevolod I. Avrutskiy. Author of the paper

Paper: Training Image Derivatives: Increased Accuracy and Universal Robustness

Discord Thread

5: Understanding Zephyr(Presented on 11/16/2023)

Presenter: Isamu Isozaki

Paper: Zephyr: Direct Distillation of LM Alignment

Write up

Discord Thread

6: Literature Review on RAG(Retrieval Augmented Generation) for Custom Domains(Presented on 11/29/2023)

Presenter: Isamu Isozaki

Papers: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks + Improving the Domain Adaptation of Retrieval Augmented Generation (RAG) Models for Open Domain Question Answering + RA-DIT: Retrieval-Augmented Dual Instruction Tuning

Write up

Discord Thread

7: Understanding MagVIT2: Language Model Beats Diffusion: Tokenizer is key to visual generation(Presented on 12/13/2023)

Presenter: Isamu Isozaki

Paper: Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Write up

Discord Thread

8: Understanding Common Diffusion Noise Schedules and Sample Steps are Flawed(Presented on 12/21/2023)

Presenter: Isamu Isozaki

Paper: Common Diffusion Noise Schedules and Sample Steps are Flawed

Write up

Discord Thread

9: The Tyranny of Possibilities in the Design of Task-Oriented LLM Systems: A Scoping Survey(Presented on 1/5/2024)

Presenter: Dhruv Dhamani. Author of the paper

Paper: The Tyranny of Possibilities in the Design of Task-Oriented LLM Systems: A Scoping Survey

Discord Thread

10: Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation(Presented on 1/12/2024)

Presenter: Phil Butler

Paper: Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Write up

Unfortunately, no recordings but a coauthors came.

11: Literature Review on AI in Law(Presented on 2/2/2024)

Presenter: Isamu Isozaki

Papers: On the acceptability of arguments and its fundamental role in non-monotonic reasoning, logic programming, and n-person games+An Answer Set Programming Approach to Argumentative Reasoning in the ASPIC+ Framework+HYPO’s legacy: introduction to the virtual special issue+Induction of Defeasible Logic Theories in the Legal Domain+Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset+Large Language Models in Law: A Survey+The Smart Court - A New Pathway to Justice in China?

isamu-isozaki/huggingface-reading-group

Hugginface Reading Group

Brief History

0: Ambiguity-Aware In-Context Learning with Large Language Models(Presented on 9/27/2023)

1: Controlling Neural Networks with Rule Representations(Presented on 10/05/2023)

2: Understanding Instaflow/Rectified Flow(Presented on 10/11/2023)

3: Mysteries of Text Embeddings(Presented on 10/19/2023)

4: Training Image Derivatives: Increased Accuracy and Universal Robustness(Presented on 11/08/2023)

5: Understanding Zephyr(Presented on 11/16/2023)

6: Literature Review on RAG(Retrieval Augmented Generation) for Custom Domains(Presented on 11/29/2023)

7: Understanding MagVIT2: Language Model Beats Diffusion: Tokenizer is key to visual generation(Presented on 12/13/2023)

8: Understanding Common Diffusion Noise Schedules and Sample Steps are Flawed(Presented on 12/21/2023)

9: The Tyranny of Possibilities in the Design of Task-Oriented LLM Systems: A Scoping Survey(Presented on 1/5/2024)

10: Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation(Presented on 1/12/2024)

11: Literature Review on AI in Law(Presented on 2/2/2024)

12: A forthcoming decoder-only foundation model for time-series forecasting & further research(Presented on 2/9/2024)

13: Mamba: Linear-Time Sequence Modeling with Selective State Spaces

14: Neural Circuit Diagrams: Robust Diagrams for the Communication, Implementation, and Analysis of Deep Learning Architectures

15: SOTA on Model Merging

16: Gemini 1.5 Pro: Unlock reasoning and knowledge from entire books and movies in a single prompt

17: HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction

18: ProteinBERT: A universal deep-learning model of protein sequence and function

19: Just Say the Name: Online Continual Learning with Category Names Only via Data Generation

20: Graph Machine Learning in the Era of Large Language Models (LLMs)

21: Story Generation with AI

22: AlphaFold 3

23: AI for Physics. Hamilton Neural Networks/Lagrangian Neural Networks

24: Understanding Current State of Reasoning with LLMs

25: Multimodal Structured Generation & CVPR’s 2nd MMFM Challenge

26: SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

27: Understanding Penetration Testing with LLMs