Table 6 Data dictionary for the sibilant dataset.

From: Polish multichannel audio-visual child speech dataset with double-expert sigmatism diagnosis

Variable

Description

Format, options

word

Parent word containing the sibilant.

Char; Polish word or logotome with removed diacritics. The only exception from this rule is the logotome “drza”, which is originally “dża” () in Polish, but would be undistinguishable without diacritic from “dza” ().

sibilant

Sibilant Latin code.

A one-, two- or three-character code for the sibilant: s – /s/ (s), z – /z/ (z), c (c), dz (dz), si (ś), zi (ź), ci (ć), dzi (dź), sz (sz), rz (ż), cz (cz), drz (dż).

stress

Placement of stress on the syllable containing the sibilant.

Boolean: 1 – stress, 0 – no stress.

wordPosition

Sibilant position within the word.

Integer: 1 – initial, 2 – medial, 3 – final position.

precedingPhone

Phoneme before the sibilant.

Char; “-” if the sibilant opens the word.

followingPhone

Phoneme after the sibilant.

Char; “-” if the sibilant closes the word.

syllableCount

Number of word syllables.

Integer.

wordEN

English translation of the word.

Char.

wordPL

Polish form of the word with diacritics.

Char, UTF-8 encoding.

wordIPA

IPA transcription of the word.

Char, UTF-8 encoding.

sibilantIPA

IPA transcription of the sibilant.

Char, UTF-8 encoding.