Spaces:
Running
Running
David Dale
commited on
Commit
·
b9fc216
1
Parent(s):
bc3a3c1
mention the code-switching
Browse files- leaderboard.py +4 -0
leaderboard.py
CHANGED
|
@@ -23,6 +23,10 @@ an [ISO 639-3 code](https://en.wikipedia.org/wiki/List_of_ISO_639-3_codes) (the
|
|
| 23 |
an [ISO 15924](https://en.wikipedia.org/wiki/ISO_15924) code (the next 4 letters) for the writing system,
|
| 24 |
and optionally, a [Glottolog code](https://glottolog.org/) for the dialect.
|
| 25 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 26 |
For a fuller description of the languages and the codes used to represent them, please refer
|
| 27 |
to https://huggingface.co/datasets/facebook/bouquet#languages and the [BOUQuET paper](https://arxiv.org/abs/2502.04314).
|
| 28 |
"""
|
|
|
|
| 23 |
an [ISO 15924](https://en.wikipedia.org/wiki/ISO_15924) code (the next 4 letters) for the writing system,
|
| 24 |
and optionally, a [Glottolog code](https://glottolog.org/) for the dialect.
|
| 25 |
|
| 26 |
+
The varieties with a secondary language code (Egyptian Arabic, Colloquial Malay) use code-switching,
|
| 27 |
+
i.e. the speakers switch between the two languages (a colloquial and a standardized variety)
|
| 28 |
+
depending on the context (e.g. the formality level).
|
| 29 |
+
|
| 30 |
For a fuller description of the languages and the codes used to represent them, please refer
|
| 31 |
to https://huggingface.co/datasets/facebook/bouquet#languages and the [BOUQuET paper](https://arxiv.org/abs/2502.04314).
|
| 32 |
"""
|