cheikhfiteni's Stars
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
eevaain/tiny-tpu
A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.
MengRao/SPMC_Queue
A simple and efficient single producer multiple consumer queue, suititable for both ITC and IPC.
rezabrizi/SPMC-Queue
Very fast single producer multiple consumer queue (thread safe)
cloneofsimo/scaling-guide
WIP
RayTracing/raytracing.github.io
Main Web Site (Online Books)
Rippling/suspend-time
A cross-platform monotonic clock that is suspend-unaware, written in Rust!
jackyzha0/quartz
🌱 a fast, batteries-included static-site generator that transforms Markdown content into fully functional websites
sainnhe/gruvbox-material
Gruvbox with Material Palette
basetenlabs/Workshop-TRT-LLM
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
aio-libs/async-lru
Simple LRU cache for asyncio
magicuidesign/magicui
UI Library for Design Engineers. Animated components and effects you can copy and paste into your apps. Free. Open Source.
shashwatak/satellite-js
Modular set of functions for SGP4 and SDP4 propagation of TLEs.
HigherOrderCO/Bend
A massively parallel, high-level programming language
sweepai/sweep
Sweep: open-source AI-powered Software Developer for small features and bug fixes.
PyGithub/PyGithub
Typed interactions with the GitHub API v3
joerick/pyinstrument
🚴 Call stack profiler for Python. Shows you why your code is slow!
trevorhobenshield/twitter-api-client
Implementation of X/Twitter v1, v2, and GraphQL APIs
bgstaal/multipleWindow3dScene
A quick example of how one can "synchronize" a 3d scene across multiple windows using three.js and localStorage
ronnachum11/synthesis
bokuweb/react-rnd
🖱 A resizable and draggable component for React.
hagopj13/node-express-boilerplate
A boilerplate for building production-ready RESTful APIs using Node.js, Express, and Mongoose
harvard-ml-courses/cs181-textbook
PKU-DAIR/Hetu
A high-performance distributed deep learning system targeting large-scale and automated distributed training.
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
justinchiu/openlogprobs
Extract full next-token probabilities via language model APIs
ggerganov/llama.cpp
LLM inference in C/C++
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.