The Data Knowledge Hub is an initiative designed to serve as a central resource for those interested in monitoring social media and online discourse. It is aimed at researchers, social scientists, data scientists, journalists, practitioners, and policymakers.
Disinformation is becoming increasingly dynamic and widespread in the digital space, especially on major social platforms. Algorithms and platform design features often amplify disinformation, hate, and incitement. This project seeks to address these challenges by enabling more effective monitoring of social media.
The project reduces barriers to social media monitoring by aggregating essential information, legal frameworks, tools, and code samples through crowdsourcing. By doing so, it aims to support technically capable individuals in conducting thorough and contextual social media analysis.
- Knowledge Base: A comprehensive repository of resources related to social media monitoring.
- Tools and Code Samples: Curated tools and code snippets to facilitate research and analysis.
- Legal Information: Guidance on the legal aspects of social media monitoring.
The project is built using docusaurus. To get a local copy up and running, follow these simple steps.
- Node.js
- Yarn
- Clone the repository:
git clone https://github.com/bertelsmannstift/data-knowledge-hub
- Navigate to the documentation directory:
cd data-knowledge-hub/docs
- Install dependencies:
yarn install
To start the development server:
yarn start
Open http://localhost:3000 to view it in the browser.
To build the project for production:
yarn build
The Data Knowledge Hub also includes a collection of projects that demonstrate the use of social media monitoring tools and techniques. To get started with the projects, follow these steps:
- Clone the repository:
git clone https://github.com/bertelsmannstift/data-knowledge-hub
- Navigate to the project directory:
cd data-knowledge-hub/projects
The following projects are available in the repository:
Project | Description | Language | Plattform | Code |
---|---|---|---|---|
tiktok-scraping |
Collect data on TikTok using puppeteer | JavaScript | TikTok | Code |
tiktok-hashtag-analysis |
Analyse TikTok Hashtags | Python | TikTok | Code |
blog-webscraping |
Webscraping using rvest and selenium | R | Blogs | Code |
twitter-streaming |
Large-scale data collection on Twitter | Python | Twitter / X | Repository |
twitter-social-network |
Social Network Analysis with R | R | Twitter / X | Code |
We welcome contributions! Please see our contributing guide for more details.
This project is licensed under the MIT License. See the LICENSE file for details. The content is licensed under the Creative Commons Attribution-ShareAlike 4.0 International License.
This project is supported by many experts and contributors. We would like to thank all of them for their valuable input and feedback.
For more information, visit Data Knowledge Hub.