Fig. 1: NLP-mixed-methods pipeline for extracting and analyzing psychotherapy dissatisfaction.
From: Exploring negative experiences in psychotherapy using an NLP approach on online forum data

Blue rounded boxes represent preprocessing steps, while icons indicate the respective task. NLP refers to natural language processing. Forum posts were anonymized and aggregated into chunks, defined as consecutive aggregated posts and comments from the same Reddit user. Tokens are the basic text units processed by the model, typically corresponding to short words or word fragments. Chunks classified as relevant were further divided into text passages. A text passage is a segment within a chunk that specifically relates to dissatisfaction with psychotherapy. Subsequent steps included meta cluster assignment, manual inspection and labeling of clusters, topic modeling, user-level analysis, and sentiment analysis. Icons sourced from Flaticon.com.