/find_job_titles

find any kind of occupation or job title in a text or file

Primary LanguagePythonMIT LicenseMIT

find_job_titles

Coverage Status

Find Job Titles in Strings

  • Free software: MIT license
  • Python versions: 2.7, 3.4+

Features

  • Find any of 77k job titles in a given string
  • Text processing is extremely fast using "acora" library
  • Dictionary generation takes about 20 seconds upfront

Quickstart

Instantiate "Finder" and start extracting job titles:

>>> from find_job_titles import FinderAcora
>>> finder=FinderAcora()
>>> finder.findall(u'I am the Senior Vice President')
[('Senior Vice President', 9),
 ('Vice President', 16),
 ('President', 21)]

All possible, overlapping matches are returned. Matches contain positional information of where the match was found.

Alternatively use "finditer" for lazy consumption of matches:

>>> finder.finditer('I am the Senior Vice President')]
<generator object ...>

Credits

This package was created with Cookiecutter and the fluquid/cookiecutter-pypackage project template.