Pinned Repositories
amcharts_example
Ruby on Rails tutorial describing how to link an amCharts JavaScript chart to the data in your database
chat_correct
A Ruby gem that shows the errors and error types when a correct English sentence is diffed with an incorrect English sentence.
confidential_info_redactor
Ruby gem to semi-automatically redact confidential information from a text
confidential_info_redactor_lite
The lite version of https://github.com/diasks2/confidential_info_redactor - include your own language packs
heroku-buildpack-mecab
This is a buildpack that enables using the mecab gem on Heroku Cedar.
pragmatic_segmenter
Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
pragmatic_tokenizer
A multilingual tokenizer to split a string into tokens
ruby-nlp
A collection of links to Ruby Natural Language Processing (NLP) libraries, tools and software
surveyor_example
extended NUBIC/surveyor example (tied to a user model)
word_count_analyzer
Word Count Analyzer is a Ruby gem that analyzes a string for potential areas of the text that might cause word count discrepancies depending on the tool used. It also provides comprehensive configuration options so you can easily customize how different gray areas should be counted and find the right word count for your purposes.
diasks2's Repositories
diasks2/ruby-nlp
A collection of links to Ruby Natural Language Processing (NLP) libraries, tools and software
diasks2/pragmatic_segmenter
Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
diasks2/pragmatic_tokenizer
A multilingual tokenizer to split a string into tokens
diasks2/chat_correct
A Ruby gem that shows the errors and error types when a correct English sentence is diffed with an incorrect English sentence.
diasks2/word_count_analyzer
Word Count Analyzer is a Ruby gem that analyzes a string for potential areas of the text that might cause word count discrepancies depending on the tool used. It also provides comprehensive configuration options so you can easily customize how different gray areas should be counted and find the right word count for your purposes.
diasks2/confidential_info_redactor
Ruby gem to semi-automatically redact confidential information from a text
diasks2/confidential_info_redactor_lite
The lite version of https://github.com/diasks2/confidential_info_redactor - include your own language packs
diasks2/pretty_strings
Take strings that have been abused in the wild and clean them up (for translation tools)
diasks2/proz
ProZ is a Ruby wrapper for the ProZ.com API
diasks2/sdltm_importer
Import the content of a .sdltm translation memory file
diasks2/era_835_parser
Electronic Remittance Advice (ERA) 835 parser
diasks2/tbx_importer
TBX (TermBase eXchange) file importer
diasks2/tmx_importer
TMX translation memory file importer
diasks2/txt_tm_importer
Import the content of a .txt translation memory file
diasks2/xlf_importer
XLIFF / XLF file importer
diasks2/adapter
Shim to insulate apps from spec changes and prefix differences. Latest adapter.js release:
diasks2/audited
Audited (formerly acts_as_audited) is an ORM extension that logs all changes to your Rails models.
diasks2/delayed_job_web
Resque like web interface for delayed job
diasks2/diasks2.github.io
Kevin Scott Dias
diasks2/hackathon_project_a
1st Annual Ambiki Hackathon (Project A)
diasks2/hackathon_project_b
1st Annual Ambiki Hackathon (Project B)
diasks2/icalendar
icalendar.rb main repository
diasks2/omniauth-proz
ProZ OAuth2 Strategy for OmniAuth
diasks2/rails
Ruby on Rails
diasks2/sdltb_importer
Import the content of a .sdltb terminology file
diasks2/select2-bootstrap-theme
A Select2 v4 Theme for Bootstrap 3
diasks2/string_diff
A gem for comparing two strings for differences, and highlighting those differences.
diasks2/tf-idf-similarity
Ruby gem to calculate the similarity between texts using tf*idf
diasks2/txml_importer
Import the content of a .txml translation file
diasks2/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)