Event Detail

Dept Tea: On the Characteristics of Language Tags on the Web

Start: Tuesday, February 27, 2018, 11:30 a.m.
End: Tuesday, February 27, 2018, 12:20 p.m.
Location: 319 McGregory Hall

Speaker: Prof. Joel Sommers

Abstract: To support broad use of Internet applications such as the World Wide Web, character encodings have been developed for many scripts of the world's languages and there are standard mechanisms for indicating that content is in a particular language and/or tailored to a particular region. In this talk, I'll present research on the empirical characteristics of language tags found on web pages and in HTTP transactions. The data used in our study was collected from the top-level pages of the Alexa Top 1 Million, from six geographic perspectives. In the talk, I'll discuss the findings of our analyses, as well as describe tools and algorithms developed through our work.

This is (in part) a practice talk for the Passive and Active Measurement Conference to be held in late March.