shamanez
Founding Applied NLP & Research Team Lead @ Arcee.ai | Ph.D. in NLP
@arcee-aiAuckland New Zealand
Pinned Repositories
DAM
mergekit
Tools for merging pretrained large language models.
BERT-like-is-All-You-Need
The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
GAIL-with-WGAN-loss-for-the-Discriminator
This is about imitation learning using PPO and WGAN-GP loss. This is heavily influenced by GAIL-PPO repository in following link - https://github.com/uidilr/gail_ppo_tf. My agent will get converged to perform his task around 3384 iterations.
IMU-PLOS_LSTM
Using LSTM networks to train IMU data by PLOS - This is custom LSTM-RNN .
Self-Supervised-Embedding-Fusion-Transformer
The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.
SummarizeMe-Digital-Journal
Weakly-supervised BART-based autobiographical text summarization model.
Target-Driven-Visual-Navigation-with-Distributed-PPO
This repository has used AI2THOR CVPR data set.
Variational-Discriminator-Bottleneck-Tensorflow-Implementation
Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow - Tensorlfow Implementation
VUSFA-Variational-Universal-Successor-Features-Approximator
This repository contains implementations of the paper VUSFA
shamanez's Repositories
shamanez/Self-Supervised-Embedding-Fusion-Transformer
The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.
shamanez/VUSFA-Variational-Universal-Successor-Features-Approximator
This repository contains implementations of the paper VUSFA
shamanez/Target-Driven-Visual-Navigation-A3C-USF-LSTM
Added the LSTM node prior to the policy and the USF apprximation
shamanez/SummarizeMe-Digital-Journal
Weakly-supervised BART-based autobiographical text summarization model.
shamanez/llm-autoeval
Automatically evaluate your LLMs in Google Colab
shamanez/sementic-search-with-PEFT
Semantic Search with PEFT and Transformers
shamanez/LLM-Continual-Learning-Papers
Must-read Papers on Large Language Model (LLM) Continual Learning
shamanez/alignment-handbook
Robust recipes to align language models with human and AI preferences
shamanez/autogen-upstream
A programming framework for agentic AI 🤖
shamanez/Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
shamanez/Awesome-LLMs-Pruning
Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.
shamanez/awsome-distributed-training
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
shamanez/BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
shamanez/checkpoint-upload
This is to upload-small-checkpoints
shamanez/databricks-test
This is to check the file import from github to databricks
shamanez/e17-4yp-Large-Language-Models-in-Education
The project targets to explore the use of Large Language models in education and develop an intelligent tutor.
shamanez/examples
repository of example scripts, notebooks, projects
shamanez/Megatron-LM
Ongoing research training transformer models at scale
shamanez/mlops-zoomcamp
Free MLOps course from DataTalks.Club
shamanez/multiprocessing-tutorials
shamanez/optillm
Optimizing inference proxy for LLMs
shamanez/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
shamanez/portfolio
This repository contains the details about my project and things happening in my life.
shamanez/RAG-end2end-datasets
Datasets used in RAG-end2end repositories.
shamanez/ReAtt
Retrieval as Attention
shamanez/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search
shamanez/ROUGE-score-stat
this is a repo that calculate the statistical significant between ROUGE scores
shamanez/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
shamanez/VQ-Rec
[WWW'23] PyTorch implementation for "Learning Vector-Quantized Item Representation for Transferable Sequential Recommenders".
shamanez/VR-comfort
Contains the data cleaning and model development code.