Table 5 The basic principles of measuring investor sentiment in social media.
From: Data selection and collection for constructing investor sentiment from social media
Level 1 | Level 2 | Principle No. | Concerns |
---|---|---|---|
Data source | Daily social media | D1 | What data source do we use? What characteristics do they have? Why is this data source most relevant to our work? |
Investor eCommunity | |||
Technology framework | How to collect data | D2 | How do we access the data? How do we filter and locate the ideal data? What exactly does a basic piece of data contain? |
Time frame | D3 | Why is this time frame conducive to answering the research questions? Is the time frame conducive to obtaining conclusions that are in harmony with the present? If not, why? | |
How much data to collect | D4 | Does the study use a sufficient level of data? Why is this level of data sufficient to answer the study’s questions? | |
Barriers and compromises | - Barriers of law and ethics - Limitation of policy and technolog - Limitations of the collection scheme | D5 | What are the legal and ethical barriers to data collection? What are the policy and technical constraints? What are the limitations of the collection scheme? What are the flaws and shortcomings of the dataset? Does the dataset created explain the problem under study and why? |
- Undesirable time frame - Undesirable volume of data - Limitations of third-party datasets - Other noises |