Page MenuHomePhabricator

Language + script code request (+ regions) for lexemes + monolingual text: sw-arab, sw-arab-cd, sw-arab-mz
Open, Needs TriagePublicFeature

Description

This request is to add a tag for labeling Swahili text in the Arabic script on lexemes and monolingual text. Some lexeme examples:

https://www.wikidata.org/wiki/Lexeme:L6800

https://www.wikidata.org/wiki/Lexeme:L8402

https://www.wikidata.org/wiki/Lexeme:L221507

There are some spelling variations in the Arabic script for Swahili as used in the Democratic Republic of the Congo (cd) and Mozambique (mz), so I am requesting some regional variant codes for these. Sources documenting that this is the case;

Congo: https://brill.com/display/book/edcoll/9789004256804/B9789004256804_015.xml

Mozambique: https://www.scielo.org.za/scielo.php?pid=S0041-476X2008000100008&script=sci_arttext

Event Timeline

"Swahili as used in the Democratic Republic of the Congo (cd)" seems to have an own language code: swc per https://en.wikipedia.org/wiki/Congo_Swahili

Can you confirm that "swc" would be a better code than "sw-arab-cd". Or "swc-arab"?

I am unsure and ask @Amire80 for help. I do not want to introduce codes that are wrong or very unsual.

"Swahili as used in the Democratic Republic of the Congo (cd)" seems to have an own language code: swc per https://en.wikipedia.org/wiki/Congo_Swahili

Can you confirm that "swc" would be a better code than "sw-arab-cd". Or "swc-arab"?

I am unsure and ask @Amire80 for help. I do not want to introduce codes that are wrong or very unsual.

There are also wmw and ymk, which may work for Mozambique, but I don't actually know which ones to use.

Luckily, we have @jhsoby in the Language committee, and he happens to know some things about Swahili. Perhaps he can help better than I can?

"Swahili as used in the Democratic Republic of the Congo (cd)" seems to have an own language code: swc per https://en.wikipedia.org/wiki/Congo_Swahili

Can you confirm that "swc" would be a better code than "sw-arab-cd". Or "swc-arab"?

I am unsure and ask @Amire80 for help. I do not want to introduce codes that are wrong or very unsual.

On the Swahili documents in Arabic script from the Congo (19th century):
https://d-nb.info/123807765X/34

pg. 23 “It seems that all the Congolese Swahili documents were written in Kiunguja, the Swahili spoken in Zanzibar. None of the documents show any trace of Kingwana – the variety of Swahili which was progressively used as a lingua franca between the traders, the local population and the Europeans in parts of the Congo. This sounds quite logical, since the authors of these documents all came from Zanzibar or settlements like Ujiji and Tabora which had been founded by Zanzibari merchants. The use of Kiunguja in the Muslim Swahili community of the Congo seems to have persisted afterwards. In 1956, Harries reported that there was a big difference between Kingwana, spoken as a lingua franca by the Congolese indigenous populations, and the Swahili spoken by the Muslim population of Swahili descent, which was very close to Standard East African Swahili (Harries 1956: 396).”

It seems like the code swc is intended for Kingwana/Kitanga Swahili, which has major differences from standard Swahili. However with these code requests I am intending to use them for what is described in the quote above, standard/Zanzibari Swahili as used in the Congo.