Table 5 The basic principles of measuring investor sentiment in social media.

From: Data selection and collection for constructing investor sentiment from social media

Level 1

Level 2

Principle No.

Concerns

Data source

Daily social media

D1

What data source do we use?

What characteristics do they have?

Why is this data source most relevant to our work?

Investor eCommunity

Technology framework

How to collect data

D2

How do we access the data?

How do we filter and locate the ideal data?

What exactly does a basic piece of data contain?

Time frame

D3

Why is this time frame conducive to answering the research questions?

Is the time frame conducive to obtaining conclusions that are in harmony with the present?

If not, why?

How much data to collect

D4

Does the study use a sufficient level of data?

Why is this level of data sufficient to answer the study’s questions?

Barriers and compromises

- Barriers of law and ethics

- Limitation of policy and technolog

- Limitations of the collection scheme

D5

What are the legal and ethical barriers to data collection?

What are the policy and technical constraints?

What are the limitations of the collection scheme?

What are the flaws and shortcomings of the dataset?

Does the dataset created explain the problem under study and why?

- Undesirable time frame

- Undesirable volume of data

- Limitations of third-party datasets

- Other noises

  1. This table summarizes the principles of the data collection framework explored in the Results Section and the Discussion Section.