miradel51
Research Assistant at Chinese Academy of Sciences (Xinjiang Branch). Former NLP senior algorithm engineer at DAMO academy. Mainly focus on NLP, PTM, ML.
Chinese Academy of Sciences (Xinjiang Branch)Urumqi, Xinjiang, China
Pinned Repositories
ABigSurveyOfLLMs
A collection of 150+ surveys on LLMs
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
codemix_ptm
optimizing cross-lingual PTM for semantic retrieval with code-mixing
miradel51.github.io
my home page
MT-Reading-List
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
preprocess
Simple script for converting corpus into lowercase
pytorch-handbook
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
Self_Supervised_CWS
This project has included related source codes and datasets of our EMNLP2021 paper
miradel51's Repositories
miradel51/adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
miradel51/Bootstrap-Admin-Theme
A generic admin theme built with Bootstrap free for both personal and commercial use.
miradel51/clevertagger
morphologically informed POS tagging for German
miradel51/deepmat
Matlab Code for Restricted/Deep Boltzmann Machines and Autoencoders
miradel51/DL4H
Deep learning for hackers: a hands-on approach to machine learning and deep learning.
miradel51/dropout
A theano implementation of Hinton's dropout.
miradel51/earleyx
The Earleyx parser was originated from Roger Levy's prefix parser, but has evolved significantly. Earleyx can generate Viterbi parses and perform rule estimation (Expectation-Maximization and Variational Bayes). The parser also implements the scaling approach as described in my TACL'13 paper which speeds up parsing time and allows for parsing long sentences (with restricted grammars).
miradel51/feat2vec
Code of NAACL paper "Unsupervised Multi-Domain Adaptation with Feature Embeddings"
miradel51/giza-pp
GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package also contains the source for the mkcls tool which generates the word classes necessary for training some of the alignment models.
miradel51/hiero-decoder-py
This is a hierarchical phrase-based translation decoder which supports parallel decoding. New feature functions can also be implemented easily.
miradel51/Hownet
客户端
miradel51/hownet-similarity
miradel51/HownetServer
服务器端程序
miradel51/jetpack
miradel51/kaldi-lstm
C++ implementation of LSTM (Long Short Term Memory), in Kaldi's nnet1 framework. Used for automatic speech recognition, possibly language modeling etc, the training can be switched between CPU and GPU(CUDA). This repo is now merged into official Kaldi codebase(Karel's setup), so this repo is no longer maintained, please check out the Kaldi project instead.
miradel51/mueller
MUELLER is a modular grid system for responsive/adaptive and non-responsive layouts, based on Compass.
miradel51/n-grams
My Python n-gram Language Model from an NLP course. Since there are so public implementations, I feel free to post mine.
miradel51/nexus-theme
Dark custom UI theme for Sublime Text 2/3 Theme
miradel51/PythonCrawler1
一个Python程序爬廖雪峰的Python教程
miradel51/rcnn
R-CNN: Regions with Convolutional Neural Network Features
miradel51/SemEval-PIT2015
data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015
miradel51/SRLParser
miradel51/theano_fftconv
Convolution op for Theano based on CuFFT using scikits.cuda
miradel51/theano_torch_bridge
Start of work that will allow to reuse Torch code with Theano
miradel51/topics
Topic modeling with gensim and LDA
miradel51/turkish-stemmer-python
:snake: Turkish Stemmer for Python
miradel51/word2vec-win32
A word2vec port for Windows.
miradel51/word2vector
some thing about the famous word2vector
miradel51/wordnet
Lexical database of any language
miradel51/zemberekMorphTR
Wrapper for zemberek morphology tool