Ating pag-ibayuhin ang ating talahuluganan!
Collects Tagalog words from tagalog.pinoydictionary.com, a database of Tagalog words powered by Cyberspace.ph Web Hosting using web scraping and web crawling techniques.
24,868 words (as of Oct 20, 2016)
Each webpage is loaded and parsed, extracting the words enclosed in <dt>
tag.
Included is tagalog.pinoydictionary.com
html
snippet containing the source of
http://tagalog.pinoydictionary.com/list/a/
to serve as guide and overview on how dictionary words from the page are extracted.
Disclaimer:
I do not own the html
code cited above, it is owned by tagalog.pinoydictionary.com.
Originally it is intended for a Scrabble ® Tagalog dictionary database, but other uses may vary.
python -m pip install -U pip beautifulsoup4
tagalog_dict.txt
is where the scrapercollect_tagalog.py
puts the collected words.- The output file
tagalog_dict.txt
will be updated from time to time to ensure up-to-date collection. 📅