Spico197

PhD student at Soochow University.

Soochow UniversitySuzhou

Pinned Repositories

llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
Language:Python853 8 1944
awesome-lm-evaluation
🩺 A collection of ChatGPT evaluation reports on various bechmarks.
49 3 03
CatalogExtraction
🌳CED: Catalog Extraction from Documents
Language:Python15 2 51
DocEE
🕹️ A toolkit for document-level event extraction, containing some SOTA model implementations.
Language:Python232 6 8636
Humback
🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.
Language:Python129 3 98
Mirror
🪞A powerful toolkit for almost all the Information Extraction tasks.
Language:Python106 5 712
MoE-SFT
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
Language:Python33 1 00
paper-hero
💪 A toolkit to help search for papers from aclanthology, arXiv and dblp.
Language:Python42 2 04
random-luck
Automatically select the best random seed based on ancient Chinese I Ching. Good luck and best wishes !
Language:Python43 4 16
watchmen
😎 A simple and easy-to-use toolkit for GPU scheduling.
Language:Python40 2 14

Spico197's Repositories

Spico197/DocEE
🕹️ A toolkit for document-level event extraction, containing some SOTA model implementations.
Language:Python232 6 8636
Spico197/Humback
🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.
Language:Python129 3 98
Spico197/Mirror
🪞A powerful toolkit for almost all the Information Extraction tasks.
Language:Python106 5 712
Spico197/MoE-SFT
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
Language:Python33 1 00
Spico197/REx
🎮 A toolkit for Relation Extraction and more...
Language:Python23 3 203
Spico197/CatalogExtraction
🌳CED: Catalog Extraction from Documents
Language:Python15 2 51
Spico197/accept
Will my paper be accepted?
Language:HTML5 1 1
Spico197/feishu-alert-bots
Language:Python4 3 0
Spico197/smoe-eval
For smoe models evaluation. Commit: b281b0921b636bc36ad05c0b0b0763bd6dd43463
Language:Python4 2 01
Spico197/Spico197.github.io
ZHU Tong's homepage.
Language:JavaScript4 2 06
Spico197/TaskLAMA
🚩 An unoffical implementation of TaskLAMA.
Language:Python3 3 0
Spico197/TaskPlanningPapers
🥅 A collection of task planning papers.
3 2 0
Spico197/daily-arxiv
Daily arXiv feed
Language:Vue2 2 1
Spico197/awesome-idol-papers
1 0
Spico197/Awesome-Mixture-of-Experts-Papers
A curated reading list of research in Mixture-of-Experts(MoE).
1 0
Spico197/azure-openai
API testcase.
Language:Python2 0
Spico197/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python1 0
Spico197/graduate
Language:HTML1 0
Spico197/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
Language:Python1 0
Spico197/LLM-Terminologies
A collection of terminologies about LLM.
2 0
Spico197/mergekit
Tools for merging pretrained large language models.
Language:Python0 0
Spico197/MyArxiv
Daily arXiv feed
Language:CSS2 0
Spico197/nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
Language:Python1 0
Spico197/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python0 0
Spico197/OpenBA
OpenBA: An Open-Sourced 15B Bilingual Asymmetric Seq2Seq Model Pre-trained from Scratch
Language:Python1 0
Spico197/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (LLaMA, LLaMa2, ChatGLM2, ChatGPT, Claude, etc) over 50+ datasets.
Language:Python1 01
Spico197/python-template
🗞️ A template for creating new Python projects.
Language:Makefile2 0
Spico197/Spico197
2 0
Spico197/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
Language:Python2 0
Spico197/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python1 0