Fishing in Speech Stream

Angling for a Lexicon

Peter Juel Henrichsen

    Research output: Contribution to journalConference article in journalResearchpeer-review

    Abstract

    We present a learning device able to deduce a set of Danish color and shape terms. Only two data sources are available to the learner: A phonetic transcription of a human informant solving a description task, and a minimal formal model of the picture being described. The system thus contains no preconceived lexical, morphological, or semantic categories. The test data are from the phonetic corpus DanPASS, a standard Danish reference corpus. The learning device, called InShape-2, is an early result of an ambitious research programme at CMOL on data-driven language learning.
    Original languageEnglish
    JournalNEALT (Northern European Association of Language Technology) Proceedings Series
    Issue number11
    Pages (from-to)90-97
    Number of pages8
    ISSN1736-6305
    Publication statusPublished - 2011
    EventNODALIDA 2011. The 18th Nordic Conference of Computational Linguistics - Riga, Latvia
    Duration: 11 May 201113 May 2011
    Conference number: 18
    http://www.lumii.lv/nodalida2011/

    Conference

    ConferenceNODALIDA 2011. The 18th Nordic Conference of Computational Linguistics
    Number18
    CountryLatvia
    CityRiga
    Period11/05/201113/05/2011
    Internet address

    Cite this

    @inproceedings{b81fc04acf664e4e947b0faa83731daf,
    title = "Fishing in Speech Stream: Angling for a Lexicon",
    abstract = "We present a learning device able to deduce a set of Danish color and shape terms. Only two data sources are available to the learner: A phonetic transcription of a human informant solving a description task, and a minimal formal model of the picture being described. The system thus contains no preconceived lexical, morphological, or semantic categories. The test data are from the phonetic corpus DanPASS, a standard Danish reference corpus. The learning device, called InShape-2, is an early result of an ambitious research programme at CMOL on data-driven language learning.",
    author = "{Juel Henrichsen}, Peter",
    year = "2011",
    language = "English",
    pages = "90--97",
    journal = "NEALT (Northern European Association of Language Technology) Proceedings Series",
    issn = "1736-6305",
    number = "11",

    }

    Fishing in Speech Stream : Angling for a Lexicon. / Juel Henrichsen, Peter.

    In: NEALT (Northern European Association of Language Technology) Proceedings Series, No. 11, 2011, p. 90-97.

    Research output: Contribution to journalConference article in journalResearchpeer-review

    TY - GEN

    T1 - Fishing in Speech Stream

    T2 - Angling for a Lexicon

    AU - Juel Henrichsen, Peter

    PY - 2011

    Y1 - 2011

    N2 - We present a learning device able to deduce a set of Danish color and shape terms. Only two data sources are available to the learner: A phonetic transcription of a human informant solving a description task, and a minimal formal model of the picture being described. The system thus contains no preconceived lexical, morphological, or semantic categories. The test data are from the phonetic corpus DanPASS, a standard Danish reference corpus. The learning device, called InShape-2, is an early result of an ambitious research programme at CMOL on data-driven language learning.

    AB - We present a learning device able to deduce a set of Danish color and shape terms. Only two data sources are available to the learner: A phonetic transcription of a human informant solving a description task, and a minimal formal model of the picture being described. The system thus contains no preconceived lexical, morphological, or semantic categories. The test data are from the phonetic corpus DanPASS, a standard Danish reference corpus. The learning device, called InShape-2, is an early result of an ambitious research programme at CMOL on data-driven language learning.

    M3 - Conference article in journal

    SP - 90

    EP - 97

    JO - NEALT (Northern European Association of Language Technology) Proceedings Series

    JF - NEALT (Northern European Association of Language Technology) Proceedings Series

    SN - 1736-6305

    IS - 11

    ER -