Pinned Repositories
tensorflow
flash-attention
Fast and memory-efficient exact attention
Olive
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
sparseml
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
tensorflow
tensorflow-xla-whl
For ubuntu 18.04 cuda 11.4 cudnn8 tensorflow 2.8
polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
TempBalance
[NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training
nicolas-mng's Repositories
nicolas-mng/tensorflow
nicolas-mng/tensorflow-xla-whl
For ubuntu 18.04 cuda 11.4 cudnn8 tensorflow 2.8