decoder-model
There are 18 repositories under decoder-model topic.
shivendrra/SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
logic-OT/Decoder-Only-LLM
This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context
partarstu/transformers-in-java
Experimental project for AI and NLP based on Transformer Architecture
aiden200/GPT3_Implementation
Implementation of the GPT-3 paper: Language Models are Few-Shot Learners
LaurentVeyssier/Image-Captioning-Project-with-full-Encoder-Decoder-model
Generate caption on images using CNN Encoder- LSTM Decoder structure
SharathHebbar/Transformers
Transformers Intuition
shivendrra/enigma
a dna sequence generation/classification using transformers
DaniyalAhmedKhan1234/Academic-Text-Simplification
This project aims to simplify texts from research papers using advanced natural language processing (NLP) techniques, making them more accessible to a broader audience
edwinthomas444/cheese_advertisement_generator
An LLM based tool for generation of cheese advirtisements
Muhammad-Ibrahim-Khan/minigpt
A miniGPT inspired from the original NanoGPT released by OpenAI. This is a notebook to walk through the decoder part of the transformer architecture with details outlined.
ahmedelsayed968/Arabic-Text-Summarizer
Build Text summarizer for arabic language
deniz-askin/Decoder-Based-Semantic-Parser
A Decoder Based Semantic Parser that can be tested on four benchmark datasets (ATIS, GeoQuery, Jobs640 and Django)
dipankarsrirag/lordd
Code and dataset used to train dialect adapters for decoder models.
hardaatbaath/multimodal_vision_model
A multimodal vision model that takes in an image and a prompt query, and output the answer
SLotAbr/Decoder_model
Decoder model for language modelling
gowtamyreddy/NLP
Text Generation using RNN, LSTM, and Transformer
KempnerInstitute/minOLMo
An explainable and simplified version of OLMo model