textprocessing

There are 46 repositories under textprocessing topic.

  • apache/opennlp

    Apache OpenNLP

    Language:Java1.6k880485
  • psemiletov/tea-qt

    TEA text editor

    Language:C++9354413
  • apache/opennlp-sandbox

    Apache OpenNLP Sandbox

    Language:Java4416034
  • tupilabs/HumanNameParser.java

    HumanNameParser.java, a Java port of HumanNameParser.php. Parser for human names in Java, all credit goes to @jasonpriem

    Language:Java276924
  • YeetWords

    verachell/YeetWords

    a domain-specific language for text substitution

    Language:Ruby13120
  • apache/opennlp-models

    Apache OpenNLP Models

    Language:Shell91202
  • cat_win

    SilenZcience/cat_win

    Advanced OS Independent Text-Processing and -Analytics Command Line Tool made in Python.

    Language:Python9210
  • kashishsinghyadav/VoiceVista-voice-to-sign-converter

    An application which takes in live speech or audio recording as input, converts it into text and displays the relevant Indian Sign Language images or GIFs.

    Language:Python8101
  • absmj/texcel

    Texel is a converting tool between text and colors. All symbols define hexadecimal values, convert to RGB values and set pixels.

    Language:JavaScript3100
  • CodeRTX/Google-Generative-AI-Colab

    A comprehensive guide to using Google Generative AI in Colab, covering API configuration, model initialization, chatting, video processing, story generation, and image processing. ๐Ÿค–๐ŸŽฌ๐Ÿ–ผ๏ธ

    Language:Jupyter Notebook31
  • Doktorierer/WordAsLatexEditor

    MS Word as a LaTeX editor/previewer

  • J-TECH-bot/Blackcoffer_Data_Extraction_NLP

    This repository showcases data-driven text analytics using NLP techniques. It combines text preprocessing, sentiment scoring, and structured data extraction to convert unstructured text into business-ready datasets.

    Language:Jupyter Notebook3100
  • jyu0414/linelog2py

    LINE Chat History Parser for Python

    Language:Python3100
  • EdenOved/SMSTranslation

    Developed an Android application that translates SMS messages into the user's preferred language, offering both on-demand and automatic translation. Integrated machine learning models for offline translation, enabling users to understand SMS notifications without internet access.

    Language:Java2100
  • HimalayaMinds/string-mastery-toolkit

    A Comprehensive Toolbox for Mastery in String Operations Across Programming Paradigms ๐Ÿš€๐Ÿ”

  • jkecb/pdf-to-epub-ai

    Python scripts pipeline for converting PDF files to clean, readable text format using openai api for post-OCR correction and text refinement.

    Language:Python2
  • ManuPer3z/Sentiment-Analysis-Project

    A project that harnesses the Stanford NLP library to gauge sentiment from provided text via an intuitive graphical interface.

    Language:Java2111
  • Vikhram-S/IndianConstitution

    A Python library for exploring the Constitution of India.

    Language:Jupyter Notebook211
  • Afnanksalal/TextWave

    Empower your projects with high-quality, natural-sounding voices, crafted for diverse applications from interactive games to enhanced accessibility.

    Language:Python1100
  • ao9000/NTU-SC1015-Data-Science-Mini-Project

    Mini project for NTU-SC1015 (Introduction to Data Science and Artificial Intelligence). Regarding fake news analysis & classification

    Language:Jupyter Notebook1101
  • arya-io/LingoLens

    A Streamlit app for language translation and sentiment analysis powered by GoogleTranslator and TextBlob.

    Language:Python1100
  • arya-io/NER-EntityLinker

    A Streamlit app that performs Named Entity Recognition (NER), links entities to Wikipedia, and handles disambiguation for ambiguous terms like "Apple," using NLP techniques.

    Language:Python1100
  • Chihuah/Process-Long-Subtitles

    ไธ€ๅ€‹ Python ๅทฅๅ…ท๏ผŒ็”จไพ†ๅฐ‡ AI ็”ข็”Ÿ็š„้Ž้•ท SRT ๅญ—ๅน•ๆ‹†ๅˆ†ๆˆๅคš่กŒ๏ผŒไธฆ่‡ชๅ‹•ๅ‡ๅˆ†ๆ™‚้–“่ปŒ๏ผŒๆๅ‡ๅญ—ๅน•ๅฏ่ฎ€ๆ€งใ€‚

    Language:Jupyter Notebook10
  • its-arnavtech/Parser_Build-Arnav

    This Project is currently working on extracting key data from a resume in order to enhance a candidate's profile

    Language:Python1
  • mantzaris/KeemenaPreprocessing.jl

    Preprocessing for text data: cleaning, normalization, vectorization, tokenization and more

    Language:Julia1
  • pauldolphin3/Acs.Core

    Easy-to-use library to work with files and processes

    Language:C#10
  • Unc1eCat/spotify-lyrics-server-substitute

    This application fixes the issue of missing lyrics on Spotify. It fetches them from other lyrics providers rather than the ones Spotify is in partnership with.

    Language:Python1100
  • arya-io/email-generator

    A Cold Email Generator that uses Groq Cloud, ChromaDB, Langchain, and Streamlit to automatically create personalized cold emails for job opportunities by extracting details from job listings.

    Language:Python0100
  • arya-io/NLP-Explorer

    NLP Explorer is an interactive Streamlit app that lets users explore various NLP techniques like Tokenization, POS Tagging, Stemming, Lemmatization, and NER. It provides real-time analysis of text, making it a great tool for learning and experimenting with NLP concepts.

    Language:Python0100
  • Hemanthsp999/Learners_Mate

    Introducing an innovative local retrieval-augmented generation tool designed to streamline and enhance learning experiences for students. This advanced system provides students with quick access to relevant information and personalized learning support, making studying more efficient and effective.

    Language:Python00
  • Tech-Guyy/Text-processing-Information-retrieval-and-information-extraction-text-to-speech

    The author implemented simple rule base solution and machine learning approach for information retrieval and information extraction after which the result were analyzed.

    Language:Jupyter Notebook0100
  • Tech-Guyy/Text-processing-sentiments-analysis

    The author implemented support vector machine for sentiments analysis and applied two feature extractions, Bag-of-Words (CountVectorizer) and TF-IDF (TfidfVectorizer), after which the results for both methods were analysed. The accuracy obtained for both methods were (BoW = 87%) and (TF-IDF = 86%).

    Language:Jupyter Notebook0100
  • bhadrakali79/NLP-Practicals

    This repository contains practical implementations of NLP concepts including dependency grammar, text processing, normalization, and TF-IDF models to demonstrate key techniques in natural language processing.

    Language:Jupyter Notebook
  • gopireddy99/Daily_AD_NLP_assignments

    AD Training classes in NLP - Daily Assignments

    Language:Jupyter Notebook10
  • suraj5424/Text-Clustering-and-Visualization

    This project processes and clusters text data using BERT embeddings, K-means, and dimensionality reduction. Visualizations include t-SNE plots and word clouds. Dataset and embeddings links are provided.

    Language:Jupyter Notebook10
  • VladAndreiToma/LeetCode_No68_TextJustification

    This is an explicative solution to LeetCode's problem No68. I provide detail explanations along the source code. Follow for more updates. Beat 100% solutions out there

    Language:Java10