Pinned Repositories
npugpt
NPU Optimized Inference of OpenAI's GPT-2 for Copilot PC with Windows and Qualcomm Snapdragon X ARM-based processor with Neural Processing Unit (NPU) for AI Acceleration
amplitude-python
Python API for Amplitude Analytics Logging - https://amplitude.com
keras2ios
Run Keras Deep Learning Models (basic case) with CoreMLTools for iOS 11
ParallelComputing-Swift-Metal
Perform Parallel Computing using Metal (objective C) and Swift
SwiftMetalForOSX
Swift and Metal example for GPU Processing on Apple OSX El Capitan 10.11 (and newer)
SwiftMetalGPUParallelProcessing
Data Parallel Processing with Swift and Metal on GPU for iOS8 (and beyond)
TensorMill
Synthetic weight generation for LLMs (e.g. GPT-OSS) for development (e.g. CI/CD)
torch_grokking
Grokking Transformer with Pytorch deep learning (AI) framework
DeepLearningKit
Open Source Deep Learning Framework for Apple's iOS, OS X and tvOS -
DeepLearningBibliography
Bibliography for Publications about Deep Learning using GPU
atveit's Repositories
atveit/torch_grokking
Grokking Transformer with Pytorch deep learning (AI) framework
atveit/jax_grokking
Grokking Transformer in Jax and Flax deep learning (AI) frameworks
atveit/TensorMill
Synthetic weight generation for LLMs (e.g. GPT-OSS) for development (e.g. CI/CD)
atveit/Agents-M365Copilot
Copilot Control Systems SDKs
atveit/AIDemoUI
Simple web UI for demonstrating AI (LLM) apps.
atveit/amplifier
atveit/bitsandbytes
8-bit CUDA functions for PyTorch
atveit/codex-universal
Base docker image used in Codex environments
atveit/dev-gpt-oss
atveit/DiskANN
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
atveit/dllm-rl
Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
atveit/gpt-oss
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
atveit/GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
atveit/HunyuanWorld-1.0
Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
atveit/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
atveit/jax-tunix-grokking
Jax Tunix Grokking
atveit/llm.c
LLM training in simple, raw C/CUDA
atveit/megadlms
GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 training.
atveit/Megakernels
kernels, of the mega variety
atveit/promptius
promptius - simple prompt engineering evaluation tool
atveit/repo2prompt
Rust library that creates a prompt in chosen format (e.g. xml, json, text) with (code) content from a repository directory
atveit/ROCm
AMD ROCm™ Software - GitHub Home
atveit/semantic-kernel
Integrate cutting-edge LLM technology quickly and easily into your apps
atveit/thoughtbubbles
atveit/tiny-diffusion
atveit/tiny-reasoning-language-model
Code repository dedicated to experimenting and research with tiny reasoning language model
atveit/TinyRecursiveModels
atveit/tinyworlds
A minimal implementation of DeepMind's Genie world model
atveit/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
atveit/tunix
A JAX-native LLM Post-Training Library