Hiroki11x

PhD Candidate at Université de Montréal, Mila / Student Research at Google DeepMind / HPC, Deep Learning, LLM / ex-Tokyo Tech, Microsoft Research, IBM Research

Mila, Université de MontréalMontreal, QC, Canada

Hiroki11x's Stars

jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Language:Python8.9k2k
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Language:Python2.1k211
lblaoke/EMCMC
PyTorch implementation of the paper "Entropy-MCMC: Sampling from Flat Basins with Ease" (ICLR 2024)
Language:Python9
thuml/CLIPood
About Code Release for "CLIPood: Generalizing CLIP to Out-of-Distributions" (ICML 2023), https://arxiv.org/abs/2302.00864
Language:Python614
ryoungj/optdom
[ICLR'22] Self-supervised learning optimally robust representations for domain shift.
Language:Python233
yorkerlin/StructuredNGD-DL
Matrix-multiplication-only KFAC; Code for ICML 2023 paper on Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning
Language:Python41
AiMl-hub/UPLM
Uncertainty-Guided Pseudo-Labelling with Model Averaging
Language:Python10
garrettj403/SciencePlots
Matplotlib styles for scientific plotting
Language:Python7.2k711
ZZhangsm/OOKD
Revisiting Knowledge Distillation under Distribution Shift: https://arxiv.org/abs/2312.16242
Language:Python4
MarlonBecker/MSAM
Language:Python15
qiaoruiyt/NoiseRobustDG
Language:Python22
hitachi-nlp/ensemble-metrics
Language:Python17
lolapriego/coursework
Checklist of videos and exercises that you can follow if you want to get a good baseline for Data Structures and Algorithms.
Language:Java441176
Nutlope/aicommits
A CLI that writes your git commit messages for you with AI
Language:TypeScript8k384
rethread-studio/algorithmic-art-course
Collection of resources for the algorithmic art course at the Université de Montréal
Language:JavaScript166
qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM
A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.
92369
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.6k2.4k
IntelLabs/academic-budget-bert
Repository containing code for "How to Train BERT with an Academic Budget" paper
Language:Python30947
sebastianbergner/ExploringCLFD
Code for the paper Exploring the Properties of Hypernetworks for Continual Learning in Robotics.
Language:Python4
adityatelange/hugo-PaperMod
A fast, clean, responsive Hugo theme.
Language:HTML10.3k2.7k
TrustAIoT/CR-SAM
Code for AAAI 2024 paper: CR-SAM: Curvature Regularized Sharpness-Aware Minimization
Language:Python71
d-li14/mobilenetv2.pytorch
72.8% MobileNetV2 1.0 model on ImageNet and a spectrum of pre-trained MobileNetV2 models
Language:Python678189
llm-jp/modelwg
LLM-jp Model-WG Working Directory
Language:Shell6
openai/weak-to-strong
Language:Python2.5k307
microsoft/mttl
Building modular LMs with parameter-efficient fine-tuning.
Language:Python868
r-three/mats
Language:Python251
filipbasara0/simple-convnext
Simple implementation of the ConvNext architecture in PyTorch
Language:Python11
namkoong-lab/whyshift
A python package providing a benchmark with various specified distribution shift patterns.
Language:Jupyter Notebook564
VirtuosoResearch/Robust-Fine-Tuning
Measuring generalization properties of fine-tuning using Hessian
Language:Python8
mlcommons/algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvements in both training algorithms and models.
Language:Python33569

Hiroki11x

Hiroki11x's Stars

jadore801120/attention-is-all-you-need-pytorch

intel/intel-extension-for-transformers

lblaoke/EMCMC

thuml/CLIPood

ryoungj/optdom

yorkerlin/StructuredNGD-DL

AiMl-hub/UPLM

garrettj403/SciencePlots

ZZhangsm/OOKD

MarlonBecker/MSAM

qiaoruiyt/NoiseRobustDG

hitachi-nlp/ensemble-metrics

lolapriego/coursework

Nutlope/aicommits

rethread-studio/algorithmic-art-course

qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM

NVIDIA/Megatron-LM

IntelLabs/academic-budget-bert

sebastianbergner/ExploringCLFD

adityatelange/hugo-PaperMod

TrustAIoT/CR-SAM

d-li14/mobilenetv2.pytorch

llm-jp/modelwg

openai/weak-to-strong

microsoft/mttl

r-three/mats

filipbasara0/simple-convnext

namkoong-lab/whyshift

VirtuosoResearch/Robust-Fine-Tuning

mlcommons/algorithmic-efficiency