thtang

Algorithm engineer

ShopeeSingapore

thtang's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python34.9k 212 5.3k4.3k
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook33.6k 360 1064.1k
LlamaFamily/Llama-Chinese
Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用
Language:Python14.1k 148 3371.3k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.8k 98 181.1k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.7k 106 589891
dottxt-ai/outlines
Structured Text Generation
Language:Python9.7k 47 630498
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook9.5k 141 4521.5k
DA-southampton/NLP_ability
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识，包括面试题，各种基础知识，工程能力等等，提升核心竞争力
Language:Python6.9k 105 51.2k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.1k 52 625475
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5.1k 49 451385
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Language:Python2.1k 46 129150
microsoft/BlingFire
A lightning fast Finite State machine and REgular expression manipulation library.
Language:C++1.8k 33 68129
noamgat/lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
Language:Python1.6k 13 11570
RUC-NLPIR/FlashRAG
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
Language:Python1.4k 12 86111
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Language:Python1.3k 22 11996
wangyuxinwhy/uniem
unified embedding model
Language:Python834 8 10665
quqxui/Awesome-LLM4IE-Papers
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
768 12 042
ContextualAI/gritlm
Generative Representational Instruction Tuning
Language:Jupyter Notebook570 9 5341
texttron/tevatron
Tevatron - A flexible toolkit for neural retrieval research and development.
Language:Python530 11 99100
sunnweiwei/RankGPT
Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]
Language:Python527 7 2150
SeanLee97/AnglE
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
Language:Python492 10 4933
rohan-paul/LLM-FineTuning-Large-Language-Models
LLM (Large Language Model) FineTuning
Language:Jupyter Notebook469 9 2112
castorini/rank_llm
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
Language:Python357 10 4642
mlpc-ucsd/BLIVA
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
Language:Python270 12 2728
chaoswork/llm_illustrated
看图学大模型
Language:Python195 7 012
facebookresearch/tart
Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.
Language:Python160 7 1211
jakespringer/echo-embeddings
Language:Python127 2 47
stanford-oval/wikidata-emnlp23
WikiSP, a semantic parser for Wikidata. WikiWebQuestions, a SPARQL-annotated dataset on Wikidata
Language:Python83 6 28
livingbio/fuzzy-json
Fuzzy-JSON is a compact Python package with no dependencies, designed to address the pesky JSONDecodeError that sometimes occurs when utilizing OpenAI's powerful call function.
Language:Python31 7 205
vegetablejuiceftw/wiki-search
Wikipedia / Wikidata search project for knowledge base RAG systems.
Language:Python30

thtang

thtang's Stars

hiyouga/LLaMA-Factory

rasbt/LLMs-from-scratch

LlamaFamily/Llama-Chinese

naklecha/llama3-from-scratch

OpenBMB/MiniCPM-V

dottxt-ai/outlines

NielsRogge/Transformers-Tutorials

DA-southampton/NLP_ability

OpenGVLab/InternVL

QwenLM/Qwen-VL

huggingface/datatrove

microsoft/BlingFire

noamgat/lm-format-enforcer

RUC-NLPIR/FlashRAG

McGill-NLP/llm2vec

wangyuxinwhy/uniem

quqxui/Awesome-LLM4IE-Papers

ContextualAI/gritlm

texttron/tevatron

sunnweiwei/RankGPT

SeanLee97/AnglE

rohan-paul/LLM-FineTuning-Large-Language-Models

castorini/rank_llm

mlpc-ucsd/BLIVA

chaoswork/llm_illustrated

facebookresearch/tart

jakespringer/echo-embeddings

stanford-oval/wikidata-emnlp23

livingbio/fuzzy-json

vegetablejuiceftw/wiki-search