dimitryslavin
interested in applying modern techniques in natural language processing + predictive analytics to improve human decision making
San Francisco, CA
dimitryslavin's Stars
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
openai/openai-cookbook
Examples and guides for using the OpenAI API
taichi-dev/taichi
Productive, portable, and performant GPU programming in Python.
Textualize/textual
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
mxrch/GHunt
🕵️♂️ Offensive Google framework.
google/or-tools
Google's Operations Research tools:
AkashSingh3031/The-Complete-FAANG-Preparation
Dive into this repository, a comprehensive resource covering Data Structures, Algorithms, 450 DSA by Love Babbar, Striver DSA sheet, Apna College DSA Sheet, and FAANG Questions! 🚀 That's not all! We've got Technical Subjects like Operating Systems, DBMS, SQL, Computer Networks, and Object-Oriented Programming, all waiting for you.
vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
PAIR-code/facets
Visualizations for machine learning datasets
airbnb/knowledge-repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
christabor/flask_jsondash
:snake: :bar_chart: :chart_with_upwards_trend: Build complex dashboards without any front-end code. Use your own endpoints. JSON config only. Ready to go.
obsei/obsei
Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .
MilaNLProc/contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
google-research/tapas
End-to-end neural table-text understanding models.
JohnSnowLabs/spark-nlp-workshop
Public runnable examples of using John Snow Labs' NLP for Apache Spark.
obsidianforensics/unfurl
Extract and Visualize Data from URLs using Unfurl
asdf-format/asdf
ASDF (Advanced Scientific Data Format) is a next generation interchange format for scientific data
streamlet-dev/tributary
Streaming reactive and dataflow graphs in Python
phillipdupuis/dtale-desktop
Build a data visualization dashboard with simple snippets of python code
saharmor/realtime-transcription-playground
A real-time transcription project using React and socketio
sarl/sarl
SARL Agent-Oriented Programming Language http://www.sarl.io
jupyterlab-contrib/jupyterlab-unfold
An IDE-like file browser for JupyterLab
slgero/receipt_parser
Allow parsing Russian receipts
briancaffey/sec-filings-app
This repo contains code for a web application that allows users to view SEC Filing data.
gnaneshwar441/Business_Duration
Calculates business duration in days, hours, minutes and seconds by excluding weekends, public holidays and non-business hours
rflynn/regroup
Generate a regular expression that describes a set of strings.
veryfi/veryfi-python
Python module for communicating with the Veryfi OCR API.
air-yan/InvoiceOCR
This project aims to automate the receipt/invoice parsing process.
cphouser/receiptapp
A graphical interface for reading item data on grocery receipts using Tesseract OCR. Each item and corresponding price is parsed from the text and displayed in the interface for correcting errors from OCR. The data from each receipt is then saved with the transaction date in a JSON file. Developed in course ”Data Wrangling and Web Scraping.”
adnanalvee/spark-assist
Helper functions for performance optimization and data cleansing