sukuya
A dumb scientist trying to make intelligent machines. A non-linear thinker.
Rakuten Group Inc.Singapore
sukuya's Stars
guidance-ai/guidance
A guidance language for controlling large language models.
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
imthenachoman/How-To-Secure-A-Linux-Server
An evolving how-to guide for securing a Linux server.
smol-ai/developer
the first library to let you embed a developer agent in your own app!
huggingface/trl
Train transformer language models with reinforcement learning.
togethercomputer/OpenChatKit
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
Lightning-AI/lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
ray-project/llm-numbers
Numbers every LLM developer should know
NVIDIA/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
fangwei123456/spikingjelly
SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.
jeshraghian/snntorch
Deep and online learning with spiking neural networks in Python
bgavran/Category_Theory_Machine_Learning
List of papers studying machine learning through the lens of category theory
Picovoice/picovoice
On-device voice assistant platform powered by deep learning
lucidrains/CoLT5-attention
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
bgavran/Category_Theory_Resources
List of resources for learning Category Theory
ZNLP/BigTranslate
BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages
intel/ideep
Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.
VinAIResearch/VinAI_Translate
A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)
google-research/mt-metrics-eval
Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.
acfr/RobustNeuralNetworks.jl
A Julia package for robust neural networks.
VinAIResearch/PhoMT
PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)
benediktahrens/CT4P
bitextor/monotextor
fufeisi/Usage-of-the-8bit-Quantization-in-Neural-Network-Training
This repo has the script to reproduce the experiments in project 'Usage of the 8bit Quantization in Neural Network Training'.
alvations/howtos
Many asks the why, I want to know the howtos
rakutentech/pisah
Sentence Splitter Library (C++ port of pySBD)
waferai/playground
This is a waferai playground repository, where the team test different models.