Pinned Repositories
ajna
Ajna Data Science - Web Tool for Exploratory Data Analysis
aleph
an open source malware handling system
Alitheia-Core
A platform for software engineering research
annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
anomaly-detection
A simple demonstration of sub-sequence sampling as used for anomaly detection with EKG signals
antlr-url-grammar
ANTLR URL Grammar
APM_Exercises
Exercises for the book Applied Predictive Modeling by Kuhn and Johnson (2013)
datasharing
The Leek group guide to data sharing
Security-Data-Analysis-with-R
A series of labs that will help users apply various data science techniques to security related data. Based on Mike Sconzo and David Dorsey work with ML in Python.
urlscan.io-R
Basic R functions to submit and recover results from URLScan.io through API
ekamioka's Repositories
ekamioka/awesome-datascience
:memo: An awesome Data Science repository to learn and apply for real world problems.
ekamioka/awesome-healthcare
Curated list of awesome open source healthcare software, libraries, tools and resources.
ekamioka/BEPb
Config files for my GitHub profile.
ekamioka/CAAFE
CAAFE lets you semi-automate your feature engineering process based on your explanations on the dataset and with the help of language models. It is based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
ekamioka/data-science-from-scratch
code for Data Science From Scratch book
ekamioka/diagrams
:art: Diagram as Code for prototyping cloud system architectures
ekamioka/DidierStevensSuite
Please no pull requests for this repository. Thanks!
ekamioka/doccano
Open source text annotation tool for machine learning practitioner.
ekamioka/DumpsterDiver
Tool to search secrets in various filetypes.
ekamioka/gRPC-NoTF-Client
ekamioka/HowToBeADataScientistImpostor-book
Chapters, code, and organizational materials for the book "How to be a data scientist impostor?"
ekamioka/langchain
🦜🔗 Build context-aware reasoning applications
ekamioka/lazypredict
Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning
ekamioka/misp-warninglists
Warning lists to inform users of MISP about potential false-positives or other information in indicators
ekamioka/natlas
Scaling Network Scanning. Changes prior to 1.0 may cause difficult to avoid backwards incompatibilities. You've been warned.
ekamioka/oletools
oletools - python tools to analyze MS OLE2 files (Structured Storage, Compound File Binary Format) and MS Office documents, for malware analysis, forensics and debugging.
ekamioka/pandas-ai
Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational
ekamioka/pyinstxtractor
PyInstaller Extractor
ekamioka/rsty-stack-example
ekamioka/santa
A binary authorization system for macOS
ekamioka/scikit-learn-lambda
Toolkit for deploying scikit-learn models for realtime inference on AWS Lambda
ekamioka/shapash
🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
ekamioka/SourceCodeVisualizer
Visualize how a projects source code is distributed among its files and folders
ekamioka/static-files
A collection of static files maintained by the Sublime team, primarily used for phishing defense.
ekamioka/tabnet
PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf
ekamioka/tfserving-python-predict-client
Client used to send grcp requests to a tfserving model
ekamioka/URL-Classification
Machine learning to classify Malicious (Spam)/Benign URL's
ekamioka/utils.py
ekamioka/webpage
ekamioka/windows_sdk_data
Windows API listing in JSON format - generated from SDK headers + SDK API documentation