yoshitomo-matsubara/torchdistill

A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.

PythonMIT

Issues

get an error
#481 opened 4 months ago by cxchen100
6
Hi @cxchen100 ,
#482 opened 4 months ago by cxchen100
1
Possible re-implementation of KD w/ LS
#478 opened 6 months ago by sunshangquan
4
Knowledge distillation related
#476 opened 6 months ago by andynnnnn
1
[BUG] Problems in DistillationBox
#471 opened 7 months ago by 1396066796
2
[BUG] Importing ForwardHookManager messes up logging to file globally
#465 opened 7 months ago by nirgoren
2
Incorrect conditional judgement
#467 opened 7 months ago by 1396066796
2
About the application scenarios supported by the program
#462 opened 8 months ago by KOOKOKOK
1
[BUG] ModuleNotFoundError: No module named 'torch._six'
#448 opened 9 months ago by nirgoren
3
is tochdistill support knowlede distillation for Vision Foundation Models like Grounding Dino / Grounding DinoSAM ?
#427 opened a year ago by solomonmanuelraj
1
[BUG]ImportError: cannot import name 'import_dependencies' from 'torchdistill.common.main_util'
#398 opened a year ago by zhangruju
2
[BUG] Missing Link in Readme
#389 opened a year ago by m-parchami
1
[BUG] fp16 causes AssertionError: No inf checks were recorded for this optimizer
#386 opened a year ago by jsrdcht
4
I tried with this script also, only single nproc seems to be working. Do i need to define any additional enviornment variables like RANK or LocaL HOST
#379 opened a year ago by nighting0le01
1
[BUG] Not supported to Nvidia 4090
#367 opened 2 years ago by allent4n
1
How should I use Torchdistill？
#365 opened 2 years ago by 2842193395
1
Not a bug but a discrepency between the log and config file for kd-resnet18_from_resnet34
#279 opened 2 years ago by Coderx7
1
Where is trained model?
#259 opened 2 years ago by Holmes2002
1
Custom Data
#255 opened 2 years ago by Holmes2002
1
Use different models as Teacher/Student
#246 opened 2 years ago by jaideep11061982
1
Disagreement betweeen the log and configuration of kd-resnet18_from_resnet34
#237 opened 2 years ago by Calmepro777
1
ValueError: batchmean is not a valid value for reduction
#234 opened 2 years ago by nguyenvulong
1
Why using `log_softmax` instead of `softmax`?
#233 opened 2 years ago by nguyenvulong
1
How to train my own COCO dataset for object detection?
#222 opened 3 years ago by Muke6
1
Similarity Preserving KD
#221 opened 3 years ago by hanoonaR
2
Inquiry about "CacheableDataset from wrapper"
#165 opened 3 years ago by AhmedHussKhalifa
3
Is it possible to use a model with YOLO framework?
#81 opened 3 years ago by matt-sharp
4
Combine two distillation losses
#215 opened 3 years ago by shwinshaker
9
How to run my own dataset using the object detection example?
#123 opened 3 years ago by Coldfire93
24
If the Teacher model is different from Student model, how can I use this framework？
#133 opened 3 years ago by topbookcc
2
Bug. Bad implement.
#214 opened 3 years ago by aiboys
2
Distilling Knowledge from a image classification model with sigmoid function and binary cross entropy
#211 opened 3 years ago by publioelon
3
It seems some bug in `split_dataset`
#209 opened 3 years ago by aiboys
1
Affinity Loss usage
#181 opened 3 years ago by ChidanandKumarKS
2
Drop Fully Connected Layer of a Pretrained model
#168 opened 3 years ago by AhmedHussKhalifa
5
Hyperparameters tunning
#154 opened 3 years ago by AhmedHussKhalifa
3
Using forward hook for auxiliary loss
#152 opened 3 years ago by 1ho0jin1
2
Implementation of SemCKD
#140 opened 3 years ago by AndyFrancesco29
7
Support for SSD Object Detection Model?
#117 opened 3 years ago by anujdutt9
3
How to use different methods for a single task?
#132 opened 3 years ago by AndyFrancesco29
15
CSE-L2 KD mobilenetv2 from resnet18 on cifar100
#131 opened 3 years ago by wongyufei
2
How to specify the weight of the downloaded teacher model without access to the Internet?
#121 opened 3 years ago by wongyufei
1
AssertionError: DistributedDataParallel is not needed when a module doesn't have any parameter that requires a gradient.
#122 opened 3 years ago by wongyufei
2
Segmentation fault encountered when entering the second epoch with num_workers>0
#90 opened 4 years ago by RulinShao
5
ForwardHookManager on multiple GPUs
#32 opened 4 years ago by arbellea
6
RuntimeError: CUDA error: device-side assert triggered
#75 opened 4 years ago by PotatoThanh
8
About dockerfile of torchdistill
#36 opened 4 years ago by lliai
1
AttributeError: 'dict' object has no attribute 'flatten'
#34 opened 4 years ago by lliai
2