Publications
This is a searchable list of publications of scientists working at or associated with the Institute of Phonetics and Speech Processing. You can choose to sort the list by year or by publication type.
The complete list in BibTeX format can be downloaded here:
Download list of publications (bibtex)
The “Research Reports of the Institute of Phonetics and Speech Communications” (FIPKM, “Forschungseberichte des Instituts für Phonetik und Sprachliche Kommunikation“) were edited and published for 39 volumes until the series was discontinued in 2002. Some of the volumes published between 1996 and 2002 are available online. Others are available in print at request.
More information
Search
Reference
Shoemark, P., Kirby, J., Goldwater, S. (2018). Inducing a Lexicon of Sociolinguistic Variables from Code-Mixed Text. In Proceedings of the 2018 EMNLP Workshop W-NUT (pp. 1-6).
BibTeX
@inproceedings{shoemarkInducingLexiconSociolinguistic2018, title = {Inducing a Lexicon of Sociolinguistic Variables from Code-Mixed Text}, booktitle = {Proceedings of the 2018 {{EMNLP Workshop W-NUT}}}, author = {Shoemark, Philippa and Kirby, James and Goldwater, Sharon}, year = {2018}, volume = {Proceedings of the 2018 EMNLP Workshop W-NUT: The 4th Workshop on Noisy User-generated Text}, pages = {1--6}, publisher = {Association for Computational Linguistics}, address = {Brussels, Belgium}, abstract = {Sociolinguistics is often concerned with how variants of a linguistic item (e.g., nothing vs. nothin') are used by different groups or in different situations. We introduce the task of inducing lexical variables from code-mixed text: that is, identifying equivalence pairs such as (football, fitba) along with their linguistic code (football{$\rightarrow$}British, fitba{$\rightarrow$}Scottish). We adapt a framework for identifying gender-biased word pairs to this new task, and present results on three different pairs of English dialects, using tweets as the code-mixed text. Our system achieves precision of over 70\% for two of these three datasets, and produces useful results even without extensive parameter tuning. Our success in adapting this framework from gender to language variety suggests that it could be used to discover other types of analogous pairs as well.} }
Powered by bibtexbrowser