arman-hk

Pinned Repositories

alexnet
AlexNet Implementation in PyTorch
Language:Jupyter Notebook0 1 00
arman-hk
arman-hk
0 1 00
BiGS
Bidirectional Gated State Space Models for NLP
Language:Python0 0 00
clusterdata-prediction
MLP neural network program on cluster data collected from production clusters in Alibaba for cluster management research
Language:Jupyter Notebook1 0 01
code-lamini
Demonstration of how to build a retrieval augmented instruction model in lamini
Language:Python0 0 00
cot-unfaithfulness
Language:Python0 0 00
fastbook
The fastai book, published as Jupyter Notebooks
Language:Jupyter Notebook1 0 00
mlp-mcr
Multi-layer Perceptron Regression on Microservice Call Rate
Language:Jupyter Notebook1 1 00
BiGS
Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE benchmark with subquadratic complexity in length (or without attention).
Language:Python114 4 17
Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.6k 100 163243

arman-hk's Repositories

arman-hk/clusterdata-prediction
MLP neural network program on cluster data collected from production clusters in Alibaba for cluster management research
Language:Jupyter Notebook1 0 01
arman-hk/fastbook
The fastai book, published as Jupyter Notebooks
Language:Jupyter Notebook1 0 00
arman-hk/mlp-mcr
Multi-layer Perceptron Regression on Microservice Call Rate
Language:Jupyter Notebook1 1 00
arman-hk/alexnet
AlexNet Implementation in PyTorch
Language:Jupyter Notebook0 1 00
arman-hk/arman-hk
arman-hk
0 1 00
arman-hk/BiGS
Bidirectional Gated State Space Models for NLP
Language:Python0 0 00
arman-hk/code-lamini
Demonstration of how to build a retrieval augmented instruction model in lamini
Language:Python0 0 00
arman-hk/cot-unfaithfulness
Language:Python0 0 00
arman-hk/data-structures-c
Implementing Data Structures in C
Language:C0 1 00
arman-hk/lamini
Language:Python0 0 00
arman-hk/llama
Inference code for LLaMA models
Language:Python0 0 00
arman-hk/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following ability and in-context learning.
Language:Python0 0
arman-hk/Personalize-SAM
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
Language:Python0 0
arman-hk/Sophia
The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
Language:Python0 0
arman-hk/watermarking-sum
Applying the watermarking algorithm on a pre-trained summarization model (BART) and a summarization dataset (CNN)
Language:Jupyter Notebook1 0