nmjirving's Stars
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
wandb/openui
OpenUI let's you describe UI using your imagination, then see it rendered live.
HKUDS/DCRec
[WWW'2023] "DCRec: Debiased Contrastive Learning for Sequential Recommendation"
RUCAIBox/CIKM2020-S3Rec
Code for CIKM2020 "S3-Rec: Self-Supervised Learning for Sequential Recommendation with Mutual Information Maximization"
RUCAIBox/FMLP-Rec
KevinMusgrave/pytorch-metric-learning
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
aksnzhy/xlearn
High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
modelscope/dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.
AaronHeee/Query-SeqRec
PyTorch Implementation of Query-Aware Sequential Recommendation (CIKM'22)
JennyXieJiayi/UnifiedSSR
[WWW '24] UnifiedSSR: A Unified Framework of Sequential Search and Recommendation
Ethan00Si/SESREC-SIGIR-2023
The implementation of the SIGIR 2023 paper "When Search Meets Recommendation: Learning Disentangled Search Representation for Recommendation"
TengShi-RUC/UniSAR
The implementation of UniSAR
antklen/sasrec-bert4rec-recsys23
Code for ACM RecSys 2023 paper "Turning Dross Into Gold Loss: Is BERT4Rec really better than SASRec?"
OpenVLG/DELLA
Official code for the NAACL 2022 paper "Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation"
kakaobrain/rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
Caiyun-AI/DCFormer
KingGugu/Awesome-Contrastive-Learning-and-Data-Augmentation-RS-Paper-Code
The latest research progress of Contrastive Learning(CL), Data Augmentation(DA) and Self-Supervised Learning(SSL) in Recommender Systems
lhao499/language-quantized-autoencoders
Language Quantized AutoEncoders
vvvm23/vqvae-2
PyTorch implementation of VQ-VAE-2 from "Generating Diverse High-Fidelity Images with VQ-VAE-2"
unslothai/unsloth
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
TencentQQGYLab/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
Tlntin/Qwen-TensorRT-LLM
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
RUCAIBox/LC-Rec
tyxsspa/AnyText
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>