/2019-zhejiang-universities-admission-info-scraper

爬取2019浙江省大学三位一体招生信息,并写入Excel表格。

Primary LanguageJupyter Notebook

三位一体招生信息爬虫

三位一体招生信息爬虫用于爬取**教育在线中的浙江高校三位一体报考简章中的信息,并写入Excel 表,以供学生及生涯规划咨询师查阅。

Version

  • 0.0.0

    • ADD: Method get_html
    • ADD: Method get_homepage_html
    • ADD: Method parse_homepage
    • ADD: Method write_homepage_form_to_excel
  • 0.0.1

    • CHANGE: The format of docstrings
    • CHANGE: Some variable and arguments names
    • ADD: A bunch of printings lines for debugging and tracing the running of the program
  • v0.0.2

    • CHANGE: Delete some useless files
  • v1.0.0

    • Fixed: Exclude **美术学院 to write to the excel
    • ADD: Method get_link(school_name)
    • ADD: Method get_admission_guide(link)
    • ADD: Method parse_admission_guide(html)
    • ADD: Method write_admission_guide_to_cvs()
  • v1.1.0

    • CHANGE: The name of the file "三位一体招生信息爬虫.py" to "admission_info_crawler"
    • Add: File test_admission_info_crawler.py
  • v1.2.0

    • CHANGE: Method get_admission_guide
    • ADD: Method get_one_page_admission_guide

Contact

Fan Zhang