A project about Twitter Analysis of Big data course
Our goal is to find a more efficient way to figure out Twitter uses who obtain a huge influence on others by using big data techniques.
Python3.5 Tensorflow TWINT
pip install python3
Some papers
SIGMOD
Robust, Scalable, Real-Time Event Time Series Aggregation at Twitter.
SIGKDD
Dynamic Embeddings for User Profiling in Twitter.
ICDE
Predicting Named Entity Location Using Twitter.
SIGIR
The Evolution of Content Analysis for Personalized Recommendations at Twitter.
AAAI
Twitter Summarization Based on Social Network and Sparse Reconstruction.
IJCAI
A Comparative Study of Transactional and Semantic Approaches for Predicting Cascades on Twitter.
Automatic Opioid User Detection from Twitter: Transductive Ensemble Built on Different Meta-graph Based Similarities over Heterogeneous Information Network.
EMNLP
Exploring Optimism and Pessimism in Twitter Using Deep Learning.
Transferring from Formal Newswire Domain with Hypernet for Twitter POS Tagging.
CIKM
Causal Dependencies for Future Interest Prediction on Twitter.
An Effective Approach for Modelling Time Features for Classifying Bursty Topics on Twitter.
ACL
ClaimPortal: Integrated Monitoring, Searching, Checking, and Analytics of Factual Claims on Twitter
A web server monitoring real things in twitter.
Multi-Modal Sarcasm Detection in Twitter with Hierarchical Fusion Model
Sarcasm detection by triple models.
Twitter Homophily: Network Based Prediction of User’s Occupation
Using GCN to exploit homophily to predict occupation.
Categorizing and Inferring the Relationship between the Text and Image of Twitter Posts
Judge four relationships of image and text.
EMNLP
Exploring Optimism and Pessimism in Twitter Using Deep Learning
NAACL
Deep Learning for Depression Detection of Twitter Users.
Only consider text of users which have completely dataset.
Main Idea:
Based on user's actions and tweets to decide whether a user has potential depression.
A step by step series of whole procedure that tell you how to finish Twitter analysis.
1. Prepare dataset. Plan to crawl 10,000 users in Twitter and 100 tweets of each user and 'following & follower' of each user.
2. Cleaning data and build relation entries of users. Delete some words like 'the', 'a' and some useless blanks. Store user information in database.
Nov. 5 Proposal
Dec. 3 Proj pre
- Crawl data : 2 person (needs same as the aaai paper data format) Cui Xu
- text model : 1 per Liu Yanyan
- graph model : 1 per Li Zenan
- follower/following : 1 per Su Linyin
- Data part : Washing data & Splicing data format (1 per)
- Define api : 1 per
- Text model coding : 1 per
- Graph model coding : 1 per
- Follower/following coding : 1 per
- We need to combine all code and fix bugs.
- Writing proposal.
- Find tricks which can improve our performances.
- Cui Mingyu
- Li Zenan
- Liu Yanyan
- Su Linyin
- Xu Jiangyue
This project is licensed under the MIT License - see the LICENSE.md file for details
- Hat tip to anyone whose code was used
- Inspiration: We need pay more attention on those who has a depression experience.