Inspecting utterances#
Searching utterances#
You can search for specific utterances either by file, speaker, or text. The text search has the capability of using regular expressions. Additionally, you can use the replace field to replace all instances of a text query with another string in the corpus. Replacements can also include regular expressions references such as \1
to refer to groups in the search expression. See the Python regular expression documentation for more details.
Utterance search results#
The table of utterances contains all search results (or all utterances if there are no filters), and can be sorted according to each column. The columns contain information about whether it has an OOV (if a pronunciation dictionary has been loaded), the file, speaker, begin, end, duration, and text by default. By right clicking on the table header, additional columns can be viewed, though they are only relevant once steps like alignment, transcription, or loading ivectors have been done.
Utterance details#
The top right of the Anchor window contains the waveform and spectrogram for the currently selected utterance.
Note
See Configuring the spectrogram and Configuring pitch tracks for options related to spectrogram and pitch track display.
The bottom left of the Anchor window contains the text transcription of the currently selected utterance, along with additional tiers if steps like alignment or transcription have been performed. Right clicking on an interval (outside of any text boxes for the transcription) allows for changing the speaker of the utterance or making tiers visible/hidden.
Toolbar#
The toolbar at the bottom of the Anchor window provides a number of common actions for inspecting, editing, and transforming utterances.
The toolbar contains multiple sections for various types of common actions. Note that these actions also have keyboard shortcuts that can be viewed and customized by Configuring Anchor.
Utterance playback#
The primary action in this section is for playing/pausing the audio (/), by default the keyboard shortcut is tab
.
Editing utterances#
Utterances can be split in half (), merged into a single utterance (), or deleted ().
The advanced functionality is only available when an acoustic model and pronunciation dictionary have been loaded, as they perform alignment. The alignment action () generates an alignment for the utterance, and the segmentation action () splits the utterance based on VAD and what gets aligned in each segment VAD returns.
The alignment action is similar to MFA’s align_one command and the segmentation action is similar to MFA’s segment command.
Help#
There are two help actions in the toolbar for opening up Anchor documentation () and reporting any issues you’ve encountered while using Anchor ().