Pinned Repositories
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
flake8-coding
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
llm_experiments
malfet.github.io
My Web experiments
Mandelbrot
oneDNN
oneAPI Deep Neural Network Library (oneDNN)
PeachPy
x86-64 assembler embedded in Python
pocketfft
Clone of https://gitlab.mpcdf.mpg.de/mtr/pocketfft
malfet's Repositories
malfet/Mandelbrot
malfet/pytext
A natural language modeling framework based on PyTorch
malfet/thrust
Thrust is a parallel algorithms library which resembles the C++ Standard Template Library (STL).
malfet/bloaty
Bloaty McBloatface: a size profiler for binaries
malfet/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
malfet/GameOfLife-gtc2015
malfet/gloo
Collective communications library with various primitives for multi-machine training.
malfet/onnx
Open standard for machine learning interoperability
malfet/opencv
Open Source Computer Vision Library
malfet/opencv_contrib
Repository for OpenCV's extra modules
malfet/pybind11
Seamless operability between C++11 and Python
malfet/pytorch-ci-hud
Better front page for ci.pytorch.org than Jenkins provides
malfet/qemu
Official QEMU mirror. Please see http://wiki.qemu.org/Contribute/SubmitAPatch for how to submit changes to QEMU. Pull Requests are ignored. Please only use release tarballs from the QEMU website.
malfet/sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
malfet/tensorflow
Open source software library for numerical computation using data flow graphs.
malfet/tutorials
PyTorch tutorials.
malfet/vision
Datasets, Transforms and Models specific to Computer Vision
malfet/xla
Enabling PyTorch on Google TPU