[AUDITORY] AW: [AUDITORY] Tool for automatic syllable segmentation

Subject: [AUDITORY] AW: [AUDITORY] Tool for automatic syllable segmentation

From: "Moshona, Cleopatra Christina" <0000033a8625d6d6-dmarc-request@xxxxxxxxxxxxxxx>

Date: Fri, 20 Sep 2024 07:39:21 +0000

Accept-language: de-DE, en-US

Arc-authentication-results: i=1; mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=qtzkRUC0; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.103 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=mcgill.ca

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20240605; h=list-archive:list-owner:list-subscribe:list-unsubscribe:list-help :precedence:in-reply-to:to:comments:subject:from:sender:reply-to :date:message-id:mime-version:content-language:accept-language :references:thread-index:thread-topic:approved-by:dkim-signature; bh=aKMHkvjX7/EHtY7kmNcdICqaT7z5cZPkJkWnMO4wSEw=; fh=5/42mu9FVmfuMp6n0xGXVcDar2H3ENcHt8Uv11Om8gY=; b=ADOmAV3gxhnCYWbRS6oef4VJg67NktqFBACCJRWJ/NIw0T14zQpEVqe4YDXsYNf0nV p2PSNjcQQ2iT2DrxUIpSYNFagP272v++diIFLxw+WmILzPE1idbmSxUwiuOouD3p5LuQ ZAh4kDHxPuFSU4BEGFqLju0LFrM5sEM9Dt0Af5NV8hHSqib796gcHrltn66k7oMYI0KD zWQiU6bIlHbejW7jTU3yN4VtSfiaTLwDbKMlicx3BP+84I8UN1MYhMMH2mXU0DjInykx SaEzV6MqtSkRcnO/LAgdBk/+YAZdPLwNCXv1DBTOFMKE/8zbWX1n9tyPGjrrPdyyhtZr SV6Q==; dara=google.com

Arc-seal: i=1; a=rsa-sha256; t=1726821763; cv=none; d=google.com; s=arc-20240605; b=eUPfcn/CLdfWXORSvnV97iPZO2rfSF8YjWpA2fZPQz4jyjbxrnzDh1xbhJhpNXYgf/ 7O9bJSrrIcfg/+Ujhsm3m+lvKq+pQX4j0HjzfkHs2Z2XQEJ6nbNGoika33pkxn4iuQgL 6WtyrjH6lwCnTWQ+B9KYLc+J63vhdq6YkVtRkt5A+ASgXf9SF8ImTHT8d0p0fhoqgiFe afZN7Upn5A289bDt1kuRxu30RM7BhBJc00JK/acMF7nJei14FQJ9hyzR24MXnXwnjnxO nBp0XhogioQzQwr72ziyNZucGNKyJigl5zE7rW2gESZ1otpM9y9j6ezsT2ZSc7Uly1ab fCWg==

Authentication-results: mx.google.com; dkim=pass header.i=@LISTS.MCGILL.CA header.s=SELECTOR1 header.b=qtzkRUC0; spf=pass (google.com: domain of owner-auditory@xxxxxxxxxxxxxxx designates 132.206.27.103 as permitted sender) smtp.mailfrom=owner-auditory@xxxxxxxxxxxxxxx; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=mcgill.ca

Comments: To: Rémy MASSON <remy.masson@xxxxxxxxxx>

Delivered-to: dan.ellis@xxxxxxxxx

Dkim-signature: v=1; a=rsa-sha256; d=LISTS.MCGILL.CA; s=SELECTOR1; c=relaxed/relaxed; bh=aKMHkvjX7/EHtY7kmNcdICqaT7z5cZPkJkWnMO4wSEw=; i=@LISTS.MCGILL.CA; h=Approved-By:Thread-Topic:Thread-Index:References:Accept-Language:Content-Language:Content-Type:MIME-Version:Message-ID:Date:Reply-To:Sender:From:Subject:To:In-Reply-To:List-Help:List-Unsubscribe:List-Subscribe:List-Owner:List-Archive; b=qtzkRUC0Qm0LllEoqQbbBaJnvsv0AH/SJCONCPHy60kTImGDSL4ZrVe/+6onSAqtKXf7XtU327BdHNOLR+gEIWcoPlC33oF+e1HrvXc7o3cG2WTaVcC23HEzqtwIzN9EKink/0xdNCr1+coLESrn4EfVTAYGteiziiQmv1l9oRIwCIxgKu3HeJ4IqJWSrZBvDg558nYOQXd2lHyjsCpKXXkWeug0Z0GgiyvoCHK028dAyFwVaAv98eKYdprTJGkXZ7LUmgXYti108xHMLf2sZi6rgOskdaXkN+RkIMxI0YbywHBI9+JulQd/+PBC1/yt9cauqVjwAY6ouFikHuHI4A==

In-reply-to: <1e5ea9e398c44e609f9053065a7642d3@pasteur.fr>

List-archive: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>

List-help: <https://lists.mcgill.ca/scripts/wa.exe?LIST=AUDITORY>, <mailto:LISTSERV@LISTS.MCGILL.CA?body=INFO%20AUDITORY>

List-owner: <mailto:AUDITORY-request@LISTS.MCGILL.CA>

List-subscribe: <mailto:AUDITORY-subscribe-request@LISTS.MCGILL.CA>

List-unsubscribe: <mailto:AUDITORY-unsubscribe-request@LISTS.MCGILL.CA>

References: <1e5ea9e398c44e609f9053065a7642d3@pasteur.fr>

Reply-to: "Moshona, Cleopatra Christina" <c.moshona@xxxxxxxxxxxx>

Sender: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx>

Thread-index: AdsJyXDrdP2YXbaYT0+Eoe1I3ivV5wBZBQWQ

Thread-topic: [AUDITORY] Tool for automatic syllable segmentation

Dear Rémy,

I would recommend the following workflow:

Use the G2P→ MAUS→ PHO2SYL pipeline in webMaus and select TextGrid as an output. If you do not use ASR services, you will need to upload an orthographic transcript of the syllables along with each sound file. This is especially recommended if the syllables are nonsensical. The pipeline results in several layers, one of which is MAS, containing the syllabified chain.
The output of webMaus (especially if no ASR service is used) can be somewhat unprecise, because it is based on HMM probabilities. It is therefore a wise choice to manually review and adjust the syllable segmentation (drag the boundaries) in the Praat TextGrid.
You can then use a script in Praat to automatically count your interval tiers on the MAS layer. This will give you the correct number of syllables. There are several freely available scripts flying around to help you automate the process. You can also use the Python-version of Praat: Parslemouth, which is a bit more flexible in combination with other Python libraries.

Hope this helps.

Best regards,

Cleo

________________________________________________

Cleopatra Christina Moshona, M.A., M.A.

Research Associate

Technische Universität Berlin

Faculty V – Mechanical Engineering and Transport Systems

Institute of Fluid Dynamics and Technical Acoustics

Engineering Acoustics - Psychoacoustics Group

Room: HFT-TA 438
Telephone.: +49 (0)30 314-70437

https://www.tu.berlin/akustik

Von: AUDITORY - Research in Auditory Perception <AUDITORY@xxxxxxxxxxxxxxx> im Auftrag von Rémy MASSON <remy.masson@xxxxxxxxxx>
Gesendet: Mittwoch, 18. September 2024 17:52:47
An: AUDITORY@xxxxxxxxxxxxxxx
Betreff: [AUDITORY] Tool for automatic syllable segmentation

Hello AUDITORY list,

We are attempting to do automatic syllable segmentation on a collection of sound files that we use in an experiment. Our stimuli are a rapid sequence of syllables (all beginning with a consonant and ending with a vowel) with no underlying semantic meaning and with no pauses. We would like to automatically extract the syllable/speech rate and obtain the timestamps for each syllable onset.

We are a bit lost on which tool to use. We tried PRAAT with the Syllable Nuclei v3 script, the software VoiceLab and the website WebMaus. Unfortunately, for each of them their estimation of the total number of syllables did not consistently match what we were able to count manually, despite toggling with the parameters.

Do you have any advice on how to go further? Do you have any experience in syllable onset extraction?

Thank you for your understanding,

Rémy MASSON

Research Engineer

Laboratory "Neural coding and neuroengineering of human speech functions" (NeuroSpeech)

Institut de l’Audition – Institut Pasteur (Paris)

Accueil | Institut de l'audition