Write a script to download and parse the given XML feed, manipulate some of the data, and deliver a CSV of the required fields. You may use any additional libraries that you wish, please include a requirements.txt if you do.
- Contains only properties listed from 2016 [DateListed]
- Contains only properties that contain the word "and" in the Description field
- CSV ordered by DateListed
- Required fields:
- MlsId
- MlsName
- DateListed
- StreetAddress
- Price
- Bedrooms
- Bathrooms
- Appliances (all sub-nodes comma joined)
- Rooms (all sub-nodes comma joined)
- Description (the first 200 characters)
- Interpreter version: python 2.7
- Reasonable unit test coverage
- All libraries used must be documented in requirements.txt
- We will be using
pip install -r requirements.txt
prior to running your code
- We will be using
- Raw information to parse / feed url
- http://syndication.enterprise.websiteidx.com/feeds/BoojCodeTest.xml
- This feed must be downloaded from with in the script, raw data must not be downloaded manually
- Work should be tracked with Git
- Submit final product by submitting a pull request
A lot of the work in our department is parsing and manipulating data from a variety of sources. The given example is one of our XML files that we send to Zillow for property syndication. Our goal in this test is to see how you will approach the processing of this feed. Your solution should take into the account that there will be other XML feeds that need parsing as well, so how modular/reusable you make the code is very important.
This assignment is expect to take a few hours. We ask that you do not spend too much time on this solution. If you are stuck or have questions, feel free to reach out and we will answer quickly.