news-crawler
There are 33 repositories under news-crawler topic.
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
flairNLP/fundus
A very simple news crawler with a funny name
lewisdonovan/google-news-scraper
Lightweight scraper for Google News
lumyjuwon/KoreaNewsCrawler
A korean news crawler built to ingest large amounts of news data.
LuChang-CS/news-crawler
A news crawler for BBC News, Reuters and New York Times.
johnbumgarner/newshound
This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around the world in over 50 languages.
stardust95/NewsFeeds
Newsfeeds website using nodejs as server and mongo as storage backends, including a simple recommendation system. 基于Node.js的新闻聚合网站, 支持基于用户行为推荐新闻.
atulyakumar97/news-sentiment-analysis
The spider crawls moneycontrol.com and economictimes.com to fetch news of input companies and also scores and classifies the companies to raise an early warning signal
nploi/news_crawler
News crawler là một công cụ giúp bạn có thể crawl dữ liệu của một trang tin tức.
SecondDim/crawler-news
Use python scrapy build crawler for real-time Taiwan NEWS website.
divkakwani/webcorpus
Generate large textual corpora for almost any language by crawling the web
fingeredman/teanaps-web-scraper
텍스트 분석용 데이터 수집을 위한 웹스크래핑 도구를 제공합니다.
MoritzGoeckel/NodeJsNewsCrawler
📰 Search engine for news in NodesJS
AndyTheFactory/article-extraction-dataset
Article title, authors, date and body extraction dataset.
sakshamssr/GNews-API
A Fast and lightweight Python API that search for articles on Google News and returns a JSON response.
andy-clarke-uofg/Pundits-Review
11/09/2020 - Complete directory for Pundits Review web application. https://www.punditsreview.com/
santhoshse7en/Alcoholics-Anonymous
Research Project to analyse the knowledge about Alcoholics Anonymous in public
siristechnology/news-crawler
Config based news crawler using Google Puppeteer
V3RNE42/NEWS_CRAWLER
📰 NEWS_CRAWLER: Automate Your News Updates! 📰 A NodeJS web crawler that generates personalized newsletters using Resend and OpenAI APIs. Ideal for staying on top of web trends and automating your news feed.
arian-askari/persian_news_websites_crawler
Crawler (Scraper) for several well-known persian news for scraping public data
aufamiri/berita-crawler
a web crawler to take all the latest indonesian news from many sources
karimhabush/TheguardianScrapper
A Scrapy webscraper that can scrape and store articles of theguardian.com
luhuadong/newscraper
🐞 A general news information crawler.
thinh-vu/vnnews
A Python package that helps capture news updates from top Vietnamese news sites
sunight1999/news-crawler
Naver and Daum news web crawler via JSoup + Selenium.
tunahanoguz/news-crawler
News crawler project written in Python.
BelisAliosmanova/NewsCrawler
News crawler
guillempuche/news_crawler
Scrape news from Olot town hall (https://www.olot.cat) with TypeScript and Crawlee. Collects summaries and full articles, stored in separate datasets.