Pinned Repositories
AnkiTools4j
anki decks creation in Java
deep_learning_school
tasks and projects from the deep learning school by MIPT
distiller
knowledge distillations for bert (classification, token classification models)
hebrew_summarizer
finetuning experiments on summarization tasks for Hebrew
huawei-nlpcourse-project
Topic modeling and classification news on Hebrew with Neural Text Summarizer model
jupyter-notebook-viewer
chrome extension for viewing Jupyter Notebooks in the browser without Jupyter Server
wav2vec2-hebrew
Speech Recognition for Hebrew (using wav2vec2 models)
yandex-practicum
tasks and projects from the data science course by Yandex.Practicum
imvladikon's Repositories
imvladikon/jupyter-notebook-viewer
chrome extension for viewing Jupyter Notebooks in the browser without Jupyter Server
imvladikon/AnkiTools4j
anki decks creation in Java
imvladikon/wav2vec2-hebrew
Speech Recognition for Hebrew (using wav2vec2 models)
imvladikon/deep_learning_school
tasks and projects from the deep learning school by MIPT
imvladikon/hebrew_summarizer
finetuning experiments on summarization tasks for Hebrew
imvladikon/bm25_vectorizer
sklearn compatible bm25 vectorizers
imvladikon/distiller
knowledge distillations for bert (classification, token classification models)
imvladikon/news_scrapers
This repository contains scripts for scraping news from different sources
imvladikon/weak_annotators
imvladikon/imvladikon
imvladikon/abydos
Abydos NLP/IR library for Python [imvladikon] made some changes
imvladikon/annotations_deduplications
scripts to deduplicate annotations and to refine NER spans or to analyze the differences
imvladikon/biorxiv_scraper
imvladikon/blt
Code for BLT research paper
imvladikon/campus-dl
A simple tool to download video lectures from campus.gov.il (based on edx-dl)
imvladikon/cdatasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble [imvladikon] added cython implementations
imvladikon/deduplicator
Simple entity deduplication package
imvladikon/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
imvladikon/evaluate
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
imvladikon/html_extractor
imvladikon/indonesian_nlp_experiments
some experiments in Indonesian NLP (information extraction from the courts reports)
imvladikon/pysubs3
A Python library for editing subtitle files (fork of pysubs2 with changes)
imvladikon/scraper-cars
imvladikon/seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
imvladikon/spacy-trankit
💥 Trankit models directly in spaCy💥
imvladikon/string-embed
😆 string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].
imvladikon/telegram-bot-hebrew
telegram (spring boot, java) with some language services for hebrew (translation, inflection)
imvladikon/trie_hard_py
imvladikon/wikitalk_parser
Fetching and parsing Wikipedia Talks
imvladikon/ydata
YDATA school assignments