David Dale commited on
Commit
b9fc216
·
1 Parent(s): bc3a3c1

mention the code-switching

Browse files
Files changed (1) hide show
  1. leaderboard.py +4 -0
leaderboard.py CHANGED
@@ -23,6 +23,10 @@ an [ISO 639-3 code](https://en.wikipedia.org/wiki/List_of_ISO_639-3_codes) (the
23
  an [ISO 15924](https://en.wikipedia.org/wiki/ISO_15924) code (the next 4 letters) for the writing system,
24
  and optionally, a [Glottolog code](https://glottolog.org/) for the dialect.
25
 
 
 
 
 
26
  For a fuller description of the languages and the codes used to represent them, please refer
27
  to https://huggingface.co/datasets/facebook/bouquet#languages and the [BOUQuET paper](https://arxiv.org/abs/2502.04314).
28
  """
 
23
  an [ISO 15924](https://en.wikipedia.org/wiki/ISO_15924) code (the next 4 letters) for the writing system,
24
  and optionally, a [Glottolog code](https://glottolog.org/) for the dialect.
25
 
26
+ The varieties with a secondary language code (Egyptian Arabic, Colloquial Malay) use code-switching,
27
+ i.e. the speakers switch between the two languages (a colloquial and a standardized variety)
28
+ depending on the context (e.g. the formality level).
29
+
30
  For a fuller description of the languages and the codes used to represent them, please refer
31
  to https://huggingface.co/datasets/facebook/bouquet#languages and the [BOUQuET paper](https://arxiv.org/abs/2502.04314).
32
  """