iamjoona

Helsinki, Finland

iamjoona's Stars

deezer/spleeter
Deezer source separation library including pretrained models.
Language:Python25.9k2.8k
google-research/bert
TensorFlow code and pre-trained models for BERT
Language:Python38.1k9.6k
jamesaphoenix/Click_Through_Rate_Optimization_Google_Search_Console
This is a small, mini-project where I created a simple machine learning model to predict the click through rate of a given URL (web page) using Python + sci-kit learn.
Language:Jupyter Notebook51
codelucas/newspaper
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
Language:Python14.1k2.1k
findopendata/findopendata
A search engine for Open Data
Language:Python536
commoncrawl/cc-pyspark
Process Common Crawl data with Python and Spark
Language:Python40586
jroakes/screaming-frog-shingling
Uses Screaming Frog Internal HTML with text extraction along with a shingling algorithm to compare content duplication across the pages of a crawled site.
Language:Python4213
searchVIU/Labs
searchVIU Labs
Language:Jupyter Notebook3530
benjaminestes/bq-stat
Get Stat ranking data into BQ for use in Data Studio.
Language:Python2
ecoron/SerpScrap
SEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.
Language:Python25461
pandas-dev/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Language:Python43.7k17.9k
buriy/python-readability
fast python port of arc90's readability tool, updated to match latest readability.js!
Language:Python2.7k348
max-mapper/art-of-node
:snowflake: a short introduction to node.js
Language:JavaScript9.8k854
ranksense/url-inspector-automator
URL Inspection Tool Automator
Language:Python247
MLTSEO/MLTS
Machine Learning Toolkit for SEO
Language:Jupyter Notebook13743
NimaSoroush/differencify
Differencify is a library for visual regression testing
Language:JavaScript63446
kalaspuffar/puppeteer-example
A small example how to use puppeteer to drive chrome
Language:JavaScript6
sohamkamani/javascript-design-patterns-for-humans
An ultra-simplified explanation of design patterns implemented in javascript
4.4k492
iihnordic/screamingfrog-docker
Docker image for ScreamingFrog version 16
Language:Dockerfile2918
browserless/browserless
Deploy headless browsers in Docker. Run on our cloud or bring your own. Free for non-commercial uses.
Language:TypeScript8.7k710
N0taN3rd/Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Language:JavaScript16826
GoogleChromeLabs/perftools-runner
Google Performance Tools runner using Puppeteer
Language:JavaScript9511
emadehsan/thal
Getting started with Puppeteer and Chrome Headless for Web Scraping
Language:JavaScript2.4k206
anishkny/webgif
Easily generate animated GIFs from websites
Language:JavaScript10515
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Language:TypeScript15.4k664
greglobinski/gatsby-starter-hero-blog
A ready to use, easy to customize, fully equipped GatsbyJS starter with a 'Hero' section on the home page.
Language:JavaScript514207
paulirish/pwmetrics
Progressive web metrics at your fingertipz
Language:TypeScript1.2k74
jeremiak/jekyll-offline
jekyll plugin to use service workers and make site content available offline
Language:JavaScript608
phantombuster/nickjs
Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)
Language:JavaScript50148

iamjoona

iamjoona's Stars

deezer/spleeter

google-research/bert

jamesaphoenix/Click_Through_Rate_Optimization_Google_Search_Console

codelucas/newspaper

findopendata/findopendata

commoncrawl/cc-pyspark

jroakes/screaming-frog-shingling

searchVIU/Labs

benjaminestes/bq-stat

ecoron/SerpScrap

pandas-dev/pandas

buriy/python-readability

max-mapper/art-of-node

ranksense/url-inspector-automator

MLTSEO/MLTS

NimaSoroush/differencify

kalaspuffar/puppeteer-example

sohamkamani/javascript-design-patterns-for-humans

iihnordic/screamingfrog-docker

browserless/browserless

N0taN3rd/Squidwarc

GoogleChromeLabs/perftools-runner

emadehsan/thal

anishkny/webgif

apify/crawlee

greglobinski/gatsby-starter-hero-blog

paulirish/pwmetrics

jeremiak/jekyll-offline

phantombuster/nickjs