kashif

Principal Research Scientist working on Deep Learning, Time Series Forecasting, Reinforcement Learning and HPC.

Berlin, Germany

kashif's Stars

argmaxinc/WhisperKit
Swift native on-device speech recognition with Whisper for Apple Silicon
Language:Swift2.5k 26 90211
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
Language:Rust2.1k 27 174124
amazon-science/chronos-forecasting
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
Language:Python1.9k 21 41226
FinanceData/FinanceDataReader
Financial data reader
Language:Jupyter Notebook1.1k 64 179354
jmtomczak/intro_dgm
"Deep Generative Modeling": Introductory Examples
Language:Jupyter Notebook913 26 4155
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python897 41 5877
stanfordnlp/pyreft
ReFT: Representation Finetuning for Language Models
Language:Python884 13 5566
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python857 11 2670
Efficient-Large-Model/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python805 19 6252
urchade/GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Language:Python771 10 6963
SalesforceAIResearch/uni2ts
Unified Training of Universal Time Series Forecasting Transformers
Language:Jupyter Notebook555 7 3946
facebookresearch/generative-recommenders
Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152, ICML'24).
Language:Python399 22 1965
xfactlab/orpo
Official repository for ORPO
Language:Python355 8 2233
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python238 4 4225
louaaron/Score-Entropy-Discrete-Diffusion
[ICML 2024 Oral] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
Language:Python209 4 720
spcl/QuaRot
Code for QuaRot, an end-to-end 4-bit inference of large language models.
Language:Python163 12 1710
AmeenAli/HiddenMambaAttn
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
Language:Python161 4 89
SalesforceAIResearch/DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
Language:Python137 5 1114
ESA-PhiLab/Major-TOM
Expandable Datasets for Earth Observation
Language:Jupyter Notebook121 11 56
felipemaiapolo/tinyBenchmarks
Evaluating LLMs with fewer examples
Language:Jupyter Notebook102 3 710
robertvacareanu/llm4regression
Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their context, without any parameter update
Language:Python98 3 313
apple/ml-4m
4M: Massively Multimodal Masked Modeling (NeurIPS 2023 Spotlight)
Language:Python963
vwxyzjn/summarize_from_feedback_details
Language:Python85 4 09
zhaoyu-li/DL4TP
A Survey on Deep Learning for Theorem Proving
67 3 03
google/codex
Data compression in JAX
Language:Python48 4 26
Asap7772/understanding-rlhf
Language:Python17 1 13
ZhaolinGao/REBEL
Language:Python151
fbarez/Interpreting-Context-Look-ups
Language:Jupyter Notebook60
hohe12ly/lag-llama
Language:Python10
Shawn-Guo-CN/Alignment_with_Huggingface
Language:Python10

kashif

kashif's Stars

argmaxinc/WhisperKit

huggingface/text-embeddings-inference

amazon-science/chronos-forecasting

FinanceData/FinanceDataReader

jmtomczak/intro_dgm

huggingface/nanotron

stanfordnlp/pyreft

uclaml/SPIN

Efficient-Large-Model/VILA

urchade/GLiNER

SalesforceAIResearch/uni2ts

facebookresearch/generative-recommenders

xfactlab/orpo

allenai/reward-bench

louaaron/Score-Entropy-Discrete-Diffusion

spcl/QuaRot

AmeenAli/HiddenMambaAttn

SalesforceAIResearch/DiffusionDPO

ESA-PhiLab/Major-TOM

felipemaiapolo/tinyBenchmarks

robertvacareanu/llm4regression

apple/ml-4m

vwxyzjn/summarize_from_feedback_details

zhaoyu-li/DL4TP

google/codex

Asap7772/understanding-rlhf

ZhaolinGao/REBEL

fbarez/Interpreting-Context-Look-ups

hohe12ly/lag-llama

Shawn-Guo-CN/Alignment_with_Huggingface