ConScript Unicode Registry

The ConScript Unicode Registry is a volunteer project to coordinate the assignment of code points in the Unicode Private Use Areas (PUA) for the encoding of artificial scripts, such as those for constructed languages.[1] It was founded by John Cowan and was maintained by him and Michael Everson. It is not affiliated with the Unicode Consortium.[2]

History

edit

The ConScript Unicode Registry is a volunteer project[3] that was founded by John Cowan[4] in the early 1990s. It is a joint project of John Cowan and Michael Everson.[4]

Historically, it was hosted on both Cowan and Everson's websites (branded as the North American and European sites, respectively); in 2002, the site was transitioned to be hosted exclusively on Everson's site.[5][6]

Since 2008, the ConScript Unicode Registry has been largely unmaintained; in 2008, Cowan explained that Everson was too busy to continue maintaining the project.[7] Due to this inactivity, Rebecca Bettencourt founded the Under-ConScript Unicode Registry, aiming to coordinate code points for constructed languages until they can be formally added to the ConScript Unicode Registry.[8] Scripts added to the Under-ConScript Unicode Registry include Sitelen Pona (for Toki Pona)[9] and Cirth.[10]

Scripts

edit

The CSUR and UCSUR include the following scripts:[1][11][8]

  Withdrawn
  Allocated only in the UCSUR
Scripts in the ConScript Unicode Registry[1][11] and Under-ConScript Unicode Registry[8]
Writing system Creator(s) Code range Notes
Aiha Ursula K. Le Guin F8A0–F8CF
Alzetjan Herman Miller E550–E57F
Amlin Thomas Thurman E6D0−E6EF
Amman-Iar David Bell E2A0–E2CF
Ath Morioka Hiroyuki F4C0−F4EF
aUI John W. Weilgart E280–E29F
Aurebesh Stephen Crane E890−E8DF
Braille Louis Braille EB60−EB9F The Braille Extended block includes all permutations of all Braille patterns as specified in the original 1829 specification, but not otherwise encoded in the Braille Patterns Unicode block.[12]
Cirth J. R. R. Tolkien E080–E0FF Unicode proposal made in September 1997[13]
No action taken but Cirth appears in the Roadmap to the SMP[14]
Cistercian numerals EBA0−EBDF
Cylenian Danae Dekker EC00−EC2F
Deini Dana Nutter ED00−ED3F
Derani Hoemaı F16B0−F16DF
Deseret E830–E88F Withdrawn after inclusion in Unicode. Use the corresponding Unicode block (U+10400–U+1044F).
D'ni Richard A. Watson E830−E88F
Engsvanyáli M. A. R. Barker E100–E14F
Ewellic Doug Ewell E680–E6CF
Ferengi Timothy Miller E600–E62F
Gargoyle Herman Miller E5C0–E5DF
Glaitha-A Rebecca G. Bettencourt E900−E97F
Glaitha-B Rebecca G. Bettencourt E980−E9FF
Graflect Aaron Paterson EC70−ECEF
Grawlixes Mort Walker EDF0−EDFF
Ilianore Jeff Smith E1B0–E1CF
Iranic Ali Hossein Mohammadi ED80−EDAF
Kamakawi David J. Peterson F000−F1FF
Kazat ?Akkorou Herman Miller E430–E44F
Kazvarad Herman Miller E450–E46F
Kelwathi Herman Miller E4F0–E4FF
Kinya Maurizio M. Gavoli E150–E1AF
Kinya

llables

Maurizio M. Gavoli F0000–F0E6F
Klingon Astra Image Corporation F8D0–F8FF Unicode proposal made in 1997[15]
Formally rejected by Unicode in 2001[16][17]
Lapointe Hexadecimal Numerals Boby Lapointe EBE0−EBEF
Lhenazi Arthaey Angosii EA00−EA9F
Martin Hexadecimal Numerals Bruce Alan Martin EBF0−EBFF
Mizarian Herman Miller E300–E33F
Monofon Steve T. Bell E800–E82F
Niji Norbert Lindenberg ED40−ED7F
Nísklôz Herman Miller E400–E42F
Olaetyan Herman Miller E3B0–E3FF
Ophidian E5E0–E5FF
Orokin Digital Extremes EB00−EB3F
Phaistos Disc E6D0–E6FF Withdrawn after inclusion in Unicode. Use the corresponding Unicode block (U+101D0–U+101FF).
Pikto John E. Williams F0E70–F16A4
Røzhxh Herman Miller E490–E4BF
Rynnan Herman Miller E520–E54F
Sadalian Wong Sadale Cho Ching F2000−F267F
Saklor Herman Miller E500–E51F
Sarkai Herman Miller E360–E37F
Serivelna Herman Miller E4C0–E4EF
Seussian Latin Extensions Theodor Geisel E630–E64F
Shavian Ronald Kingsley Read E700–E72F Withdrawn after inclusion in Unicode. Use the corresponding Unicode block (U+10450–U+1047F).
Semtog Quefei Huang F1B00−F1C3F
Sitelen Pona Sonja Lang F1900−F19FF
Sitelen Pona Radicals Sonja Lang et al. F1C80−F1C9F
Solresol Jean François Sudre E770–E77F
Ssûraki Herman Miller E5A0–E5BF
Standard Galactic Tom Hall EB40−EB5F
Streich Tommaso Donnarumma E2D0–E2DF
Syai Brian Lilburn E1D0–E1FF
Sylabica Marcin Kowalczyk EC30−EC6F
Syrrin Danae Dekker EC00−EC2F
Tassarunese Narylis EDB0−EDDF
Telarasso Herman Miller E580–E59F
Tengwar J. R. R. Tolkien E000–E07F Unicode proposal made in September 1997.[18] No action taken but Tengwar appears in the Roadmap to the SMP.[14]
Thelwik Herman Miller E380–E3AF
Titi Pula Ðoom Epictooþ F1C40−F1C7F
Tonal John W. Nystrom E8E0−E8FF
Unifon John Malone E740–E76F In UCSUR, the block Unifon Extended was added in the code range U+E6F0–U+E73F.[19]
Verdurian Mark Rosenfelder E200–E26F
Visible Speech Alexander Melville Bell E780–E7FF
Wanya Max Dominik Weber EAA0−EAFF
Whitaker Hexadecimal Numerals Ronald O. Whitaker ECF0-ECFF
Xaîni Paul Blake E2E0–E2FF
Zarkhánd Herman Miller E470–E48F
Zese Jack Eisenmann EDE0−EDEF
Zírí:Nka Herman Miller E340–E35F

Font support

edit

Some fonts support ConScript Unicode specified code points:

  • Constructium, a proportional font based on SIL Gentium[20]
  • Fairfax, a monospaced font family intended for text editors and terminals[21]
  • GNU Unifont, a bitmap font intended as a fallback font, includes CSUR and UCSUR characters in the separate Unifont CSUR package[10]
  • Horta[22]
  • Kurinto Font Folio[23]
  • Nishiki-teki[24]

See also

edit

References

edit
  1. ^ a b c "ConScript Unicode Registry". Evertype.com. Archived from the original on 2015-06-22. Retrieved 2015-06-20.
  2. ^ "FAQ - Private-Use Characters, Noncharacters, and Sentinels". Unicode Consortium. Retrieved 7 August 2023. As an example, see the ConScript Unicode Registry—a private group unaffiliated with the Unicode Consortium—which has extensive tables listing private-use character definitions for various unencoded scripts.
  3. ^ Scott, Michael Lee (2009). Programming Language Pragmatics (3rd ed.). Morgan Kaufmann. p. 295. ISBN 978-0-12-374514-9. It also sets aside a "private use area" for such artificial [constructed] languages as Klingon, Tengwar, and Cirth [Tolkien Elvish]. Allocation of this private space is coordinated by a volunteer organization known as the ConScript Unicode Registry.
  4. ^ a b Cowan, John; Everson, Michael (2023-04-27). "ConScript Unicode Registry". ConScript Unicode Registry. Evertype. Retrieved 2023-08-09. The Registry is the joint project of John Cowan and Michael Everson. John originated the Registry and did most of the work of collecting information and writing preliminary proposals; Michael has been reviewing and revising these proposals into final registrations.
  5. ^ "Wayback Machine comparison (2002-01-13 vs 2002-06-15)". ConScript Unicode Registry.
  6. ^ Cowan, John. "John Cowan". Chester County InterLink. Archived from the original on 2002-07-24.
  7. ^ Cowan, John (2008-02-04). "Re: CSUR help". KreativeKorp.
  8. ^ a b c Bettencourt, Rebecca G. "Under-ConScript Unicode Registry". Archived from the original on 2015-07-18. Retrieved 2015-07-21.
  9. ^ jan Ke Tami (2022-09-01). "nasin Juniko ala la, nasin UCSUR li lon" (PDF). Lipu Tenpo (in Toki Pona) (moku): 2–3. ISSN 2752-4639.
  10. ^ a b "GNU Unifont". Unifoundry.com. Archived from the original on 2013-11-12. Retrieved 2015-07-24.
  11. ^ a b Everson, Michael (21 March 2008). "Roadmap to the ConScript Unicode Registry". Evertype. Archived from the original on 17 July 2021. Retrieved 30 July 2021.
  12. ^ User:Vanisaac. "Braille Extended" (PDF). Retrieved 2024-01-14.
  13. ^ Everson, Michael (1997-09-18). "N1642: Proposal to encode Cirth in Plane 1 of ISO/IEC 10646-2". Working Group Document, ISO/IEC JTC1/SC2/WG2 and UTC. Archived from the original on 2015-07-29. Retrieved 2015-07-20.
  14. ^ a b "Roadmap to the SMP". Unicode.org. 2015-06-03. Archived from the original on 2015-06-23. Retrieved 2015-06-20.
  15. ^ Everson, Michael (1997-09-18). "N1643: Proposal to encode Klingon in Plane 1 of ISO/IEC 10646-2". Working Group Document, ISO/IEC JTC1/SC2/WG2 and UTC. Archived from the original on 2015-07-29. Retrieved 2015-07-20.
  16. ^ "N2370: Unicode Consortium Liaison Report". The Unicode Consortium. 2001-10-10. Archived from the original on 2015-07-31. Retrieved 2015-07-20.
  17. ^ "Archive of Notices of Non-Approval". The Unicode Consortium. Archived from the original on 2015-07-29. Retrieved 2015-07-20.
  18. ^ Everson, Michael (1997-09-18). "N1641: Proposal to encode Tengwar in Plane 1 of ISO/IEC 10646-2". Working Group Document, ISO/IEC JTC1/SC2/WG2 and UTC. Archived from the original on 2015-07-29. Retrieved 2015-07-20.
  19. ^ "Unifon Extended: U+E6F0 - U+E73F". KreativeKorp. Retrieved 2024-01-14.
  20. ^ Bettencourt, Rebecca. "Constructium". Kreative Korporation. Archived from the original on 13 July 2011. Retrieved 28 June 2017.
  21. ^ Bettencourt, Rebecca. "Fairfax". Kreative Korporation. Archived from the original on 23 October 2021. Retrieved 5 November 2021.
  22. ^ "Horta is a remarkably intelligent and sensitive typeface". Christ Trek Fonts. Archived from the original on 2019-10-31. Retrieved 2019-10-31.
  23. ^ "Kurinto Font Folio". Archived from the original on 2020-07-26. Retrieved 2020-07-27.
  24. ^ Niwatori, Shiroi. "にしき的フォント" (in Japanese). Retrieved 2024-01-14.
edit