This notes is from my lecturer. I just convert it into md.
Below is the process involve during do Big Data Project
- Performance
- Scalability
- Vertical - Need to add more RAM/resources to a single machine
- Horizontal - Add more cluster/machines
- Maintainability
- Availability
- Security
- How to acquire/export data?
- How do you store large data?
- How do you retrieve large data?
- How do you process large data?
- How do you sorting large data?
- How do you analyze large data? Structured? Semi unstructured? Unstructured?