FBM3 - “Speech Understanding”

From RoCKIn Wiki
Jump to: navigation, search

Format of input audio files and output CFR format

Input: An audio file (see example) containing a set of commands will be played through a loud speaker.

Output: text file (named results.txt or <teamname>.txt) with one line for each sentence

 Command_number|command_transcription|command_frame_representation

Example of output file

 Command1|move to the living room|MOTION(goal:"living room")
 Command2|could you please find my jacket?|SEARCHING(theme:"my jacket")
 Command3|BAD_RECOGNITION|NO_INTERPRETATION

Please refer to the ERL Consumer Rulebook for more details about input and output formats.

Resources that are available to design or/and validate the processing system realized for this benchmark are listed and linked in the following.

  • additional data can be found at: http://sag.art.uniroma2.it/HuRIC.html
  • a parser that implements the CFR grammar described in the specification module and that can be used to check if the produced output follows the correct syntax can be downloaded at "Download the parser".
  • the UPDATED lexicon composing the commands that will be used for the benchmark, in terms of different grammatical categories. It is provided within a zip file called lexicon.zip ("Download lexicon.zip") containing:
    1. verbs.txt: contains all the verbs used in the commands, comprehending modal and auxiliary verbs (e.g. "can").
    2. nouns.txt: contains all the nouns present in the commands.
    3. prepositions_and_positional_adverbs.txt: contains all the prepositions and positional adverbs (e.g. "close") used in the commands.
    4. personal_pronouns.txt: contains the list of the personal pronouns used in the commands (e.g. "me").
    5. adjectives.txt: contains the adjectives used in the commands, considering also the one derived from verbs (e.g. "dining" for "dining room").
    6. others.txt: contains all the words that are not considered in the previous categories, as articles or other kind of adverbs (e.g. "carefully").

Note: these files report most of the lexicon that could be used in the commands. Some words outside this lexicon might also be used.

Example of an audio file used for the benchmark can be downloaded from: https://sites.google.com/site/erlsrlisbon/resources/test.mp3?attredirects=0&d=1