Pinned Repositories
DatalakeProject
Datalake is a database management system storing different forms of files and allowing for search across the fields and links. Dynamo DB, MongoDB and Node.JS was used in this project.
Fault-tolerance-Chat-System
A real time chat system with replicated chat servers and multiple clients. The system allows arbitrary size group of users to send/receive real time messages to group
MapReduce-Framework
MapReduce Framework based on Storm that is flexible for any MapReduce work. Built with a number of workers and a single master.Used BerkeleyDB as temporary data storage in case of big data processing
Mini-Google-Drive
A distributed data storage platform implement with GFS structure, nodes are communicated with GRPC protocol, supporting reliable webmail and storage services. A highly scalable and consistent BigTable system in C++.
Multithread-Http-Server
An Http Application Server implements Java Servlet 2.4 Specification. It is an Http Server (servlet container) built on top of my own thread pool, blocking queue, which handles incoming request to corresponding servlets.
Web-Crawler-and-Xpath-Engine
A topic-specific crawler looks for documents or data matching a particular category - specified as an XPath expression.crawler that traverses the Web, looking for HTML and XML documents that match one of the XPath expressions
Web-Search-Engine
A google-style search engine based on over 1,000,000 webpages. Consists of distributed crawler, indexer ,ranker and user interface
xuyimeng's Repositories
xuyimeng/Mini-Google-Drive
A distributed data storage platform implement with GFS structure, nodes are communicated with GRPC protocol, supporting reliable webmail and storage services. A highly scalable and consistent BigTable system in C++.
xuyimeng/MapReduce-Framework
MapReduce Framework based on Storm that is flexible for any MapReduce work. Built with a number of workers and a single master.Used BerkeleyDB as temporary data storage in case of big data processing
xuyimeng/DatalakeProject
Datalake is a database management system storing different forms of files and allowing for search across the fields and links. Dynamo DB, MongoDB and Node.JS was used in this project.
xuyimeng/Fault-tolerance-Chat-System
A real time chat system with replicated chat servers and multiple clients. The system allows arbitrary size group of users to send/receive real time messages to group
xuyimeng/Multithread-Http-Server
An Http Application Server implements Java Servlet 2.4 Specification. It is an Http Server (servlet container) built on top of my own thread pool, blocking queue, which handles incoming request to corresponding servlets.
xuyimeng/Web-Crawler-and-Xpath-Engine
A topic-specific crawler looks for documents or data matching a particular category - specified as an XPath expression.crawler that traverses the Web, looking for HTML and XML documents that match one of the XPath expressions
xuyimeng/Web-Search-Engine
A google-style search engine based on over 1,000,000 webpages. Consists of distributed crawler, indexer ,ranker and user interface