piperino11

https://huggingface.co/sag-uniroma2

Pinned Repositories

ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Language:Jupyter Notebook00
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python0 0 00
core
Production ready AI assistant framework
Language:Python0 0 00
DeepSpeech-Italian-Model
Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
Language:Python0 0 00
GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 24
Language:Python0 0 00
gqa-it
Italian Question Answering on Image Scene Graphs
00
mamba
Language:Python00
MyNN1
Language:Python0 1 00
OCRmyImage
Language:Python00
OmniFusion
OmniFusion — a multimodal model to communicate using text and images
Language:Python00

piperino11's Repositories

piperino11/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Language:Jupyter Notebook00
piperino11/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python0 0 00
piperino11/core
Production ready AI assistant framework
Language:Python0 0 00
piperino11/DeepSpeech-Italian-Model
Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
Language:Python0 0 00
piperino11/GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 24
Language:Python0 0 00
piperino11/gqa-it
Italian Question Answering on Image Scene Graphs
00
piperino11/mamba
Language:Python00
piperino11/MyNN1
Language:Python0 1 00
piperino11/OCRmyImage
Language:Python00
piperino11/OmniFusion
OmniFusion — a multimodal model to communicate using text and images
Language:Python00
piperino11/parler-tts
Inference and training library for high-quality TTS models.
Language:Python00
piperino11/squad-it
A large scale dataset for Question Answering in Italian
0 0 00
piperino11/video-caption.pytorch
Language:Python0 0 00
piperino11/skynet
AI core services for Jitsi
Language:Python0 0
piperino11/u-deppllama
Dependency parsing with Large Language Models
piperino11/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
Language:Python0 0
piperino11/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook0 0

piperino11

Pinned Repositories

ast

audiocraft

core

DeepSpeech-Italian-Model

GLiNER

gqa-it

mamba

MyNN1

OCRmyImage

OmniFusion

piperino11's Repositories

piperino11/ast

piperino11/audiocraft

piperino11/core

piperino11/DeepSpeech-Italian-Model

piperino11/GLiNER

piperino11/gqa-it

piperino11/mamba

piperino11/MyNN1

piperino11/OCRmyImage

piperino11/OmniFusion

piperino11/parler-tts

piperino11/squad-it

piperino11/video-caption.pytorch

piperino11/skynet

piperino11/u-deppllama

piperino11/VAR

piperino11/VoiceCraft