5 a @ - - PowerPoint PPT Presentation

5 a 6
SMART_READER_LITE
LIVE PREVIEW

5 a @ - - PowerPoint PPT Presentation

5 a @ 6 Domain Name in Pakistani Languages a h:


slide-1
SLIDE 1

۔ ê 5 aا ñ ں@ز 6†

  • ،ñ a8

ہa نh: ،ا: ر

  • ا : فآ ر@ 6

hا ،a

  • ¢

Domain Name in Pakistani Languages

slide-2
SLIDE 2

www.crulp.org 2

Domain Name Domain Name

slide-3
SLIDE 3

www.crulp.org 3

Internationalized Domain Name Internationalized Domain Name

slide-4
SLIDE 4

www.crulp.org 4

What letters of Pakistani Languages What letters of Pakistani Languages should be allowed in the should be allowed in the Internationalized Domain Names Internationalized Domain Names (IDNs)? (IDNs)?

  • For each language?

For each language?

  • Collectively?

Collectively?

slide-5
SLIDE 5

www.crulp.org 5

Morning Session Morning Session

  • Background: Unicode

Background: Unicode

  • Internationalized Domain Names (IDNs)

Internationalized Domain Names (IDNs)

  • Issues and challenges related to Arabic IDNs

Issues and challenges related to Arabic IDNs

  • Sample (tentative) solution for Urdu language

Sample (tentative) solution for Urdu language

slide-6
SLIDE 6

www.crulp.org 6

Afternoon Session Afternoon Session

  • Sample language tables for the following languages

Sample language tables for the following languages

  • Balochi

Balochi

  • Pashto

Pashto

  • Punjabi

Punjabi

  • Seraiki

Seraiki

  • Sindhi

Sindhi

  • Torwali

Torwali

  • Collective Issues for multiple languages

Collective Issues for multiple languages

slide-7
SLIDE 7

www.crulp.org 7

Background: Unicode Background: Unicode

  • Everything in the computers is represented as

Everything in the computers is represented as numbers numbers

  • Initially ASCII encoding:

Initially ASCII encoding:

  • A

A 65 65

  • B

B 66 66 … …

  • Only supported Latin script, primarily English

Only supported Latin script, primarily English

  • Other encodings developed for other languages,

Other encodings developed for other languages, but cumbersome to develop separate encoding but cumbersome to develop separate encoding for each language of the world for each language of the world

slide-8
SLIDE 8

www.crulp.org 8

Unicode Unicode

  • Thus effort started to develop Universal encoding or

Thus effort started to develop Universal encoding or UNIcode UNIcode

  • Unicode Consortium develops the Unicode standard

Unicode Consortium develops the Unicode standard

  • Covers almost all writing systems in current use today

Covers almost all writing systems in current use today

  • First version

First version ‘ ‘The Unicode Standard 1.0 The Unicode Standard 1.0’ ’ published in published in 1991 1991

  • Current version

Current version ‘ ‘The Unicode Standard 5.1 The Unicode Standard 5.1’ ’ published in published in April 2008 April 2008

slide-9
SLIDE 9

www.crulp.org 9

Unicode Unicode

  • European scripts

European scripts

  • Latin, Greek, Cyrillic, Armenian, Georgian, IPA

Latin, Greek, Cyrillic, Armenian, Georgian, IPA

  • Bidirectional (Middle Eastern) scripts

Bidirectional (Middle Eastern) scripts

  • Hebrew, Arabic,

Hebrew, Arabic, Syriac Syriac, Thaana , Thaana

  • Indic (Indian and Southeast Asian) scripts

Indic (Indian and Southeast Asian) scripts

  • Devanagari, Bengali,

Devanagari, Bengali, Gurmukhi Gurmukhi, Gujarati, Oriya, Tamil, , Gujarati, Oriya, Tamil, Telugu, Kannada, Malayalam, Sinhala, Thai, Lao, Khmer, Telugu, Kannada, Malayalam, Sinhala, Thai, Lao, Khmer, Myanmar, Tibetan, Philippine Myanmar, Tibetan, Philippine

  • East Asian scripts

East Asian scripts

  • Chinese (Han) characters, Japanese (Hiragana and Katakana),

Chinese (Han) characters, Japanese (Hiragana and Katakana), Korean (Hangul), Yi Korean (Hangul), Yi

slide-10
SLIDE 10

www.crulp.org 10

Unicode Unicode

  • Other modern scripts

Other modern scripts

  • Mongolian, Ethiopic, Cherokee, Canadian

Mongolian, Ethiopic, Cherokee, Canadian Aboriginal Aboriginal

  • Historical scripts

Historical scripts

  • Runic,

Runic, Ogham Ogham, Old Italic, Gothic, , Old Italic, Gothic, Deseret Deseret

  • Punctuation and symbols

Punctuation and symbols

  • Numerals, math symbols, scientific symbols, arrows,

Numerals, math symbols, scientific symbols, arrows, blocks, geometric shapes, Braille, musical notation, blocks, geometric shapes, Braille, musical notation, etc. etc.

slide-11
SLIDE 11

www.crulp.org 11

Unicode is SCRIPT based Unicode is SCRIPT based

  • One code per character per script

One code per character per script

  • To avoid duplication of codes of same letter used by

To avoid duplication of codes of same letter used by multiple scripts multiple scripts

  • For example:

For example:

  • The character code U+06A9

The character code U+06A9 کک is same in Urdu, Sindhi, is same in Urdu, Sindhi, Pashto, Punjabi, Farsi, Pashto, Punjabi, Farsi, … …

  • Different code blocks reserved for different

Different code blocks reserved for different scripts scripts

  • For Arabic script 0600, 0601,

For Arabic script 0600, 0601, … …, 06FE, 06FF , 06FE, 06FF

slide-12
SLIDE 12

www.crulp.org 12

Characters Semantics Characters Semantics

  • The Unicode standard includes an extensive database

The Unicode standard includes an extensive database that specifies a large number of that specifies a large number of character properties, character properties, including: including:

  • Name

Name

  • Type (e.g., letter, digit, punctuation mark)

Type (e.g., letter, digit, punctuation mark)

  • Decomposition

Decomposition

  • Case and case mappings (for cased letters)

Case and case mappings (for cased letters)

  • Numeric value (for digits and numerals)

Numeric value (for digits and numerals)

  • Combining class (for combining characters)

Combining class (for combining characters)

  • Cursive joining behavior

Cursive joining behavior

slide-13
SLIDE 13

www.crulp.org 13

Unicode Unicode

  • Adopted by industry leaders as Apple, HP, IBM,

Adopted by industry leaders as Apple, HP, IBM, Microsoft, etc. Microsoft, etc.

  • Supported in many platforms including Java,

Supported in many platforms including Java, Linux and Microsoft Windows, etc. Linux and Microsoft Windows, etc.

  • Supported by many internationalized

Supported by many internationalized applications including Open Office, applications including Open Office, Firefox Firefox, , Thunderbird, Microsoft Office, etc. Thunderbird, Microsoft Office, etc.

slide-14
SLIDE 14

www.crulp.org 14

Unicode is the basis for Unicode is the basis for Internationalized Domain Names Internationalized Domain Names

slide-15
SLIDE 15

www.crulp.org 15

Morning Session Morning Session

  • Background: Unicode

Background: Unicode

  • Internationalized Domain Names (IDNs)

Internationalized Domain Names (IDNs)

  • Issues and challenges related to Arabic IDNs

Issues and challenges related to Arabic IDNs

  • Sample (tentative solution) for Urdu language

Sample (tentative solution) for Urdu language

slide-16
SLIDE 16

Domain Name System Domain Name System (DNS) (DNS)

slide-17
SLIDE 17

www.crulp.org 17

Domain Name System (DNS) Domain Name System (DNS)

  • Domain name is the address of a website which

Domain name is the address of a website which is used to access it is used to access it e.g. e.g. www.crulp.org www.crulp.org

slide-18
SLIDE 18

www.crulp.org 18

Domain Name System (DNS) Domain Name System (DNS)

www.crulp.org ISP

  • 1. www.crulp.org

Domain Name Server

  • 3. 192.168.0.1
  • 2. www.crulp.org

Host Server

  • 4. 192.168.0.1
  • 5. Requested Found / Not Found
  • 6. Request Reply

192.168.0.1 www.crulp.org =

slide-19
SLIDE 19

www.crulp.org 19

Need of IDNs Need of IDNs

  • Domain name system (DNS) is in ASCII, i.e.

Domain name system (DNS) is in ASCII, i.e. Latin script Latin script

  • Makes it difficult to access internet for people

Makes it difficult to access internet for people who do not understand English or Latin script who do not understand English or Latin script

slide-20
SLIDE 20

www.crulp.org 20

IDNs IDNs

  • Basic reason for that is the internet addresses

Basic reason for that is the internet addresses map into 7 map into 7-

  • bit ASCII standard

bit ASCII standard

  • We can not change the overall existing system.

We can not change the overall existing system.

  • The solution is to add layer that works on top of

The solution is to add layer that works on top of existing system existing system

  • IDN is any domain name consisting of labels

IDN is any domain name consisting of labels which can be converted to ASCII format which can be converted to ASCII format

  • Initial set of protocols defined in 2003

Initial set of protocols defined in 2003

slide-21
SLIDE 21

www.crulp.org 21

IDNs IDNs

  • A layer that takes the address in local languages

A layer that takes the address in local languages and converts that into ASCII format and converts that into ASCII format

  • DNS continues to resolve ASCII format

DNS continues to resolve ASCII format addresses addresses

  • IDNs may be resolved at the User

IDNs may be resolved at the User’ ’s computer s computer

slide-22
SLIDE 22

www.crulp.org 22

International Domain Name in International Domain Name in Applications (IDNA) Applications (IDNA)

ISP Domain Name Server

  • 4. 192.16.0.1

Host Server

  • 5. 192.16.0.1
  • 6. Requested Found / Not Found
  • 7. Request Reply

www.ودراaتاa.com

  • 1. Convert to ASCII Compatible Encoding

http://www.xn--mgbahbnpifd6na4a4c58gep.com/

slide-23
SLIDE 23

www.crulp.org 23

IDNA 200X IDNA 200X

  • Some Issues observed in the original IDNA2003

Some Issues observed in the original IDNA2003 protocol protocol

  • Dependence on Unicode

Dependence on Unicode ver

  • ver. 3.2

. 3.2

  • Hardcoded

Hardcoded language specific separators language specific separators

  • Decision to revise the original standard taken in

Decision to revise the original standard taken in 2006 2006

  • New standard, IDNA 200X currently under

New standard, IDNA 200X currently under development development

slide-24
SLIDE 24

www.crulp.org 24

IDNA 200X IDNA 200X

  • Assigns values to all Unicode Character

Assigns values to all Unicode Character Database (UCD) on the basis of Unicode Database (UCD) on the basis of Unicode properties properties

  • VALID (or allowed)

VALID (or allowed)

  • DISALLOWED

DISALLOWED

  • CONTEXTO or CONTEXTJ (depends on the

CONTEXTO or CONTEXTJ (depends on the context) context)

slide-25
SLIDE 25

www.crulp.org 25

Morning Session Morning Session

  • Background: Unicode

Background: Unicode

  • Internationalized Domain Names (IDNs)

Internationalized Domain Names (IDNs)

  • Arabic IDNs

Arabic IDNs

  • Sample (tentative) solution for Urdu language

Sample (tentative) solution for Urdu language

slide-26
SLIDE 26

www.crulp.org 26

Arabic Script Arabic Script

  • Arabic script is the second largest script after

Arabic script is the second largest script after Latin script. Latin script.

  • It is used for writing Arabic, Urdu, Persian,

It is used for writing Arabic, Urdu, Persian, Baluchi, Pashto, Sindhi and many other Baluchi, Pashto, Sindhi and many other languages across Pakistan and the world languages across Pakistan and the world

slide-27
SLIDE 27

www.crulp.org 27

Arabic Script Arabic Script

  • Arabic script is defined from:

Arabic script is defined from:

  • U+0600 to U+06FF

U+0600 to U+06FF

  • U+0750 to U+077F

U+0750 to U+077F

  • U+FB50 to U+FDFF

U+FB50 to U+FDFF (Obsolete presentation forms)

(Obsolete presentation forms)

  • U+FE70 to U+FEFF

U+FE70 to U+FEFF (Obsolete presentation forms except

(Obsolete presentation forms except U+FDFx U+FDFx sequence) sequence)

slide-28
SLIDE 28

www.crulp.org 28

Arabic Script Arabic Script

  • Cursive script

Cursive script

  • Shape of each letter may have four different shapes

Shape of each letter may have four different shapes depending on its position (isolated, initial, medial or depending on its position (isolated, initial, medial or final) final)

  • Written from right to left

Written from right to left

  • But Numerals written left to right

But Numerals written left to right

slide-29
SLIDE 29

www.crulp.org 29

Arabic Script Arabic Script

  • Diacritics (optionally) used for vowels

Diacritics (optionally) used for vowels

  • Stretched shapes used for text justification

Stretched shapes used for text justification

  • Shapes of letters highly context sensitive

Shapes of letters highly context sensitive

slide-30
SLIDE 30

www.crulp.org 30

Positional Shapes of Different Letters Positional Shapes of Different Letters

  • NA

NA NA NAوو

ااااچچ اااابب

Final Final Medial Medial Initial Initial Isolated Isolated

slide-31
SLIDE 31

www.crulp.org 31

Issues in Arabic Script Encoding Issues in Arabic Script Encoding

  • Similar character shapes across the languages that

Similar character shapes across the languages that creates confusions e.g. Urdu character creates confusions e.g. Urdu character ىى and Pashto and Pashto character character ىى have similar shapes have similar shapes

  • These different shapes are used as distinct letters in

These different shapes are used as distinct letters in different languages. In Sindhi these are two different different languages. In Sindhi these are two different characters: characters: ڪڪ(U+06AA), and (U+06AA), and کک(U+06A9) (U+06A9)

slide-32
SLIDE 32

www.crulp.org 32

Confusable Variants of Different Confusable Variants of Different Characters Characters

ARABIC LETTER ARABIC LETTER SWASH KAF SWASH KAF (used for Sindhi) (used for Sindhi)ببڪڪببڪڪ

اا ڪڪااڪڪ

U+06AA U+06AA ARABIC LETTER ARABIC LETTER KEHEH KEHEH (used for Persian (used for Persian and Urdu) and Urdu)

ﺑﺑ ﮏ ﮏﺑﺑﮑﮑﺎﺎﮐﮐﺎﺎ ک ک

U+06A9 U+06A9 ARABIC LETTER ARABIC LETTER KAF KAFﺑﺑﻚﻚﺑﺑﻜﻜﺎﺎآآﺎﺎكك U+0643 U+0643

Remarks Remarks

Final Final form form Medial Medial form form Initial Initial form form Isolate Isolate d form d form Unicode Unicode

slide-33
SLIDE 33

www.crulp.org 33

Confusable Variants of Different Confusable Variants of Different Characters Characters

ARABIC ARABIC LETTER ALEF LETTER ALEF MAKSURA MAKSURA

ى ىبب ى ى

U+0649 U+0649 ARABIC ARABIC LETTER FARSI LETTER FARSI YEH YEH (Arabic, Persian, (Arabic, Persian, Urdu) Urdu)

ﻠﻳﻠﻳﯽﯽﻣﻣﻴﻴﻞﻞﻳﻳﻊﻊ ی ی

U+06CC U+06CC ARABIC ARABIC LETTER YEH LETTER YEHﻠﻴﺑﻠﻴﺑ ﻲ ﻲﻣﻣﻴﻴﻞﻞﻳﻳ ﻊ ﻊ ي ي U+064A U+064A

Remarks Remarks

Final Final form form Medial Medial form form Initial Initial form form Isolated Isolated form form Unicode Unicode

slide-34
SLIDE 34

www.crulp.org 34

Optional Diacritics Optional Diacritics

  • Words normally written without diacritics, e.g. in

Words normally written without diacritics, e.g. in Urdu: Urdu:

  • َ

َ ِ ِ / /t t̪̪æ ær r/ (swim) / (swim) / /t t̪̪ir ir/ (arrow) / (arrow)

slide-35
SLIDE 35

www.crulp.org 35

Space Space

  • No concept of space between words in Urdu

No concept of space between words in Urdu

  • Need a separator character

Need a separator character

  • words may assume wrong shapes without a separator

words may assume wrong shapes without a separator e.g. e.g. ند سدند سد will be displayed erroneously will be displayed erroneously نﺪﺳدنﺪﺳد without separator without separator

  • One solution is Zero With Non Joiner (ZWNJ)

One solution is Zero With Non Joiner (ZWNJ) but users unfamiliar with it. but users unfamiliar with it.

slide-36
SLIDE 36

www.crulp.org 36

Bidirectionality Bidirectionality

  • aچراaداaaودراaتاa

aچراaداaaودراaتاa 2001 2001 aa aa رر a۔a a۔a

slide-37
SLIDE 37

www.crulp.org 37

Normalization Normalization

  • There are characters that can be typed in more

There are characters that can be typed in more than one way, e.g. than one way, e.g. U+0622 ( U+0622 ( ﺁﺁ))= = U+0627 ( U+0627 ( اا) + U+0653 ) + U+0653

  • We have to normalize these characters

We have to normalize these characters

slide-38
SLIDE 38

www.crulp.org 38

Normalization Normalization

U+0627 ( U+0627 ( ا ا) + U+0674 ) + U+0674 U+0675 ( U+0675 (ٵٵ)) U+064A ( U+064A (يي) + U+0654 ) + U+0654 U+0626 ( U+0626 (ئئ)) U+0627 ( U+0627 ( ا ا) + U+0655 ) + U+0655 U+0625 ( U+0625 ( إ إ)) U+0648 ( U+0648 ( و و) + U+0653 ) + U+0653 U+0624 ( U+0624 ( ؤ ؤ)) U+0627 ( U+0627 ( ا ا) + U+0654 ) + U+0654 U+0623 ( U+0623 ( أ أ)) U+0627 ( U+0627 ( ا ا) + U+0653 ) + U+0653 U+0622 ( U+0622 ( ﺁ ﺁ))

Decomposed Form Decomposed Form Composed Form Composed Form

slide-39
SLIDE 39

www.crulp.org 39

Normalization Normalization

U+06D2 ( U+06D2 (ےے) + U+0654 ) + U+0654 U+06D3 ( U+06D3 (ۓۓ)) U+06C1 ( U+06C1 (ﮦﮦ) + U+0654 ) + U+0654 U+06C2 ( U+06C2 (ۂۂ)) U+06D5 ( U+06D5 (ەە) + U+0654 ) + U+0654 U+06C0 ( U+06C0 (ۀۀ)) U+064A ( U+064A (يي) +U+0674 ) +U+0674 U+0678 ( U+0678 (ٸٸ) ) U+06C7 ( U+06C7 (وو) + U+0674 ) + U+0674 U+0677 ( U+0677 (ٶٶ) ) U+0648 ( U+0648 (وو) + U+0674 ) + U+0674 U+0676 ( U+0676 (ٶٶ) )

Decomposed Form Decomposed Form Composed Form Composed Form

slide-40
SLIDE 40

www.crulp.org 40

Confusable Characters Confusable Characters

ﻩﻩ

(U+06D5) (U+06D5)

ﮦﮦ

(U+06C1) (U+06C1)

ەە

(U+0647) (U+0647)

ۍۍ

(U+06CD) (U+06CD)

ىى

(U+0649) (U+0649)

یی

(U+06CC) (U+06CC)

ک ک

(U+06AA) (U+06AA)

ک ک

(U+06A9) (U+06A9)

كك

(U+0643) (U+0643)

slide-41
SLIDE 41

www.crulp.org 41

Confusable Characters Confusable Characters… …

ٳٳ(U+0673)

(U+0673)ٳٳ(U+0625) (U+0625)

ٲٲ(U+0672)

(U+0672)

أ أ(U+0623)

(U+0623)

٩ ٩ (U+06F9)

(U+06F9)

٩ ٩ (U+0669)

(U+0669)

٨ ٨ (U+06F8)

(U+06F8)

٨ ٨ (U+0668)

(U+0668)

٧ ٧ (U+06F7)

(U+06F7)

٧ ٧ (U+0667)

(U+0667)

۵ ۵ (U+06F5)

(U+06F5)

٥ ٥ (U+0665)

(U+0665)

٣ ٣ (U+06F3)

(U+06F3)

٣ ٣ (U+0663)

(U+0663)

٢ ٢ (U+06F2)

(U+06F2)

٢ ٢ (U+0662)

(U+0662)

١ ١ (U+06F1)

(U+06F1)

١ ١ (U+0661)

(U+0661)

٠ ٠(U+06F0)

(U+06F0)

٠ ٠(U+0660)

(U+0660)

slide-42
SLIDE 42

www.crulp.org 42

Morning Session Morning Session

  • Background: Unicode

Background: Unicode

  • Internationalized Domain Names (IDNs)

Internationalized Domain Names (IDNs)

  • Arabic IDNs

Arabic IDNs

  • Sample (tentative) solution for Urdu language

Sample (tentative) solution for Urdu language

slide-43
SLIDE 43

www.crulp.org 43

Urdu IDNs Urdu IDNs

  • The following are CONTEXTO by IDNA200X

The following are CONTEXTO by IDNA200X but are not recommended for Urdu but are not recommended for Urdu

slide-44
SLIDE 44

www.crulp.org 44

NO NO

CONTEXTO CONTEXTO

END OF AYAH END OF AYAH

06DD 06DD

۝ ۝

NO NO

CONTEXTO CONTEXTO

ARABIC SIGN ARABIC SIGN SAFHA SAFHA

0603 0603

؃ ؃

NO NO

CONTEXTO CONTEXTO

ARABIC ARABIC FOOTNOTE FOOTNOTE MARKER MARKER

0602 0602

؂؂

NO NO

CONTEXTO CONTEXTO

ARABIC SIGN ARABIC SIGN SANAH SANAH

0601 0601

؁ ؁

NO NO

CONTEXTO CONTEXTO

ARABIC NUMBER ARABIC NUMBER SIGN SIGN

0600 0600

؀ ؀

Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-45
SLIDE 45

www.crulp.org 45

Urdu IDNs Urdu IDNs

  • The following are PVALID by IDNA200X but

The following are PVALID by IDNA200X but these are not recommended for Urdu these are not recommended for Urdu

slide-46
SLIDE 46

www.crulp.org 46

NO NO

PVALID PVALID

ARABIC VOWEL ARABIC VOWEL SIGN SMALL V SIGN SMALL V ABOVE ABOVE

065A 065A NO NO

PVALID PVALID

ARABIC ARABIC ZWARAKAY ZWARAKAY

0659 0659 NO NO

PVALID PVALID

ARABIC ARABIC INVERTED INVERTED DAMMA DAMMA

0657 0657

ٗٗ

NO NO

PVALID PVALID

ARABIC ARABIC TATWEEL TATWEEL

0640 0640ــ NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH TAH HIGH TAH

0615 0615

LL

Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-47
SLIDE 47

www.crulp.org 47

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER ALEF WASLA ALEF WASLA

0671 0671ٱٱ NO NO

PVALID PVALID

ARABIC FATHA ARABIC FATHA WITH TWO DOTS WITH TWO DOTS

065E 065E NO NO

PVALID PVALID

ARABIC REVERSED ARABIC REVERSED DAMMA DAMMA

065D 065D NO NO

PVALID PVALID

ARABIC VOWEL ARABIC VOWEL SIGN DOT BELOW SIGN DOT BELOW

065C 065C NO NO

PVALID PVALID

ARABIC VOWEL ARABIC VOWEL SIGN INVERTED SIGN INVERTED SMALL V ABOVE SMALL V ABOVE

065B 065B Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-48
SLIDE 48

www.crulp.org 48

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER BEEH BEEH

067B 067Bٻٻ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER TTEHEH TTEHEH

067A 067Aٺٺ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER HIGH HAMZA HIGH HAMZA

0674 0674

ٔ ٔ

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER ALEF WITH WAVY ALEF WITH WAVY HAMZA BELOW HAMZA BELOW

0673 0673ٳٳ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER ALEF WITH WAVY ALEF WITH WAVY HAMZA ABOVE HAMZA ABOVE

0672 0672ٲٲ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

ٲ

slide-49
SLIDE 49

www.crulp.org 49

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER HAH WITH HAMZA HAH WITH HAMZA ABOVE ABOVE

0681 0681ځځ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER BEHEH BEHEH

0680 0680ڀڀ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER TEHEH TEHEH

067F 067Fٿٿ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER TEH WITH THREE TEH WITH THREE DOTS ABOVE DOTS ABOVE DOWN DOWN

067D 067Dٽٽ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER TEH WITH RING TEH WITH RING

067C 067Cټټ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-50
SLIDE 50

www.crulp.org 50

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER HAH WITH TWO HAH WITH TWO DOTS VERTICAL DOTS VERTICAL ABO ABO

0682 0682ڂڂ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER TCHEHEH TCHEHEH

0687 0687ڇڇ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER HAH WITH THREE HAH WITH THREE DOTS ABOVE DOTS ABOVE

0685 0685څڅ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER DYEH DYEH

0684 0684ڄڄ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER NYEH NYEH

0683 0683ڃڃ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-51
SLIDE 51

www.crulp.org 51

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER DDAHAL DDAHAL

068D 068Dﮂﮂ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER DAHAL DAHAL

068C 068Cﮄﮄ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER DAL WITH DOT DAL WITH DOT BELOW AND SMALL BELOW AND SMALL T T

068B 068Bڋڋ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER DAL WITH DOT DAL WITH DOT BELOW BELOW

068A 068Aڊڊ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER DAL WITH RING DAL WITH RING

0689 0689ډډ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-52
SLIDE 52

www.crulp.org 52

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER REH WITH RING REH WITH RING

0693 0693ړړ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER REH WITH SMALL V REH WITH SMALL V

0692 0692ڒڒ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER DAL WITH FOUR DAL WITH FOUR DOTS ABOVE DOTS ABOVE

0690 0690ڐڐ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER DAL WITH THREE DAL WITH THREE DOTS ABOVE DOTS ABOVE DOWN DOWN

068F 068Fڏڏ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER DUL DUL

068E 068E ﮆﮆ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-53
SLIDE 53

www.crulp.org 53

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER REH WITH FOUR REH WITH FOUR DOTS ABOVE DOTS ABOVE

0699 0699ڙڙ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER REH WITH TWO REH WITH TWO DOTS ABOVE DOTS ABOVE

0697 0697ڗڗ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER REH WITH DOT REH WITH DOT BELOW AND DOT BELOW AND DOT ABO ABO

0696 0696ږږ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER REH WITH SMALL V REH WITH SMALL V BELOW BELOW

0695 0695ڕڕ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER REH WITH DOT REH WITH DOT BELOW BELOW

0694 0694ڔڔ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-54
SLIDE 54

www.crulp.org 54

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER SAD WITH THREE SAD WITH THREE DOTS ABOVE DOTS ABOVE

069E 069Eڞڞ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER SAD WITH TWO SAD WITH TWO DOTS BELOW DOTS BELOW

069D 069Dڝڝ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER SEEN WITH THREE SEEN WITH THREE DOTS BELOW AND DOTS BELOW AND

069C 069Cڜڜ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER SEEN WITH THREE SEEN WITH THREE DOTS BELOW DOTS BELOW

069B 069Bڛڛ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER SEEN WITH DOT SEEN WITH DOT BELOW AND DOT BELOW AND DOT AB AB

069A 069Aښښ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-55
SLIDE 55

www.crulp.org 55

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER FEH WITH DOT FEH WITH DOT MOVED BELOW MOVED BELOW

06A2 06A2ڢڢ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER DOTLESS FEH DOTLESS FEH

06A1 06A1ڡڡ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER AIN WITH THREE AIN WITH THREE DOTS ABOVE DOTS ABOVE

06A0 06A0ڠڠ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER TAH WITH THREE TAH WITH THREE DOTS ABOVE DOTS ABOVE

069F 069Fڟڟ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-56
SLIDE 56

www.crulp.org 56

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER QAF WITH DOT QAF WITH DOT ABOVE ABOVE

06A7 06A7ڧڧ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER PEHEH PEHEH

06A6 06A6ڦڦ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER FEH WITH THREE FEH WITH THREE DOTS BELOW DOTS BELOW

06A5 06A5ڥڥ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER VEH VEH

06A4 06A4ڤڤ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER FEH WITH DOT FEH WITH DOT BELOW BELOW

06A3 06A3 ڣڣ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-57
SLIDE 57

www.crulp.org 57

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER NG NG

06AD 06ADڭڭ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER KAF WITH DOT KAF WITH DOT ABOVE ABOVE

06AC 06ACګګ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER KAF WITH RING KAF WITH RING

06AB 06ABګګ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER SWASH KAF SWASH KAF

06AA 06AAڪڪ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER QAF WITH THREE QAF WITH THREE DOTS ABOVE DOTS ABOVE

06A8 06A8 ڨڨ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-58
SLIDE 58

www.crulp.org 58

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER GAF WITH TWO GAF WITH TWO DOTS BELOW DOTS BELOW

06B2 06B2ڲڲ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER NGOEH NGOEH

06B1 06B1ڱڱ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER GAF WITH RING GAF WITH RING

06B0 06B0ڰڰ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER KAF WITH THREE KAF WITH THREE DOTS BELOW DOTS BELOW

06AE 06AEڮڮ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-59
SLIDE 59

www.crulp.org 59

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER LAM WITH THREE LAM WITH THREE DOTS ABOVE DOTS ABOVE

06B7 06B7ڷڷ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER LAM WITH DOT LAM WITH DOT ABOVE ABOVE

06B6 06B6ڶڶ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER LAM WITH SMALL V LAM WITH SMALL V

06B5 06B5ڵڵ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER GAF WITH THREE GAF WITH THREE DOTS ABOVE DOTS ABOVE

06B4 06B4ڴڴ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER GUEH GUEH

06B3 06B3ڳڳ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-60
SLIDE 60

www.crulp.org 60

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER NOON WITH RING NOON WITH RING

06BC 06BCڼڼ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER RNOON RNOON

06BB 06BBڻڻ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER NOON WITH DOT NOON WITH DOT BELOW BELOW

06B9 06B9ڹڹ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER LAM WITH THREE LAM WITH THREE DOTS BELOW DOTS BELOW

06B8 06B8ڸڸ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-61
SLIDE 61

www.crulp.org 61

NO NO

PVALID PVALID

ARABIC LETTER OE ARABIC LETTER OE

06C6 06C6ۆۆ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER KIRGHIZ OE KIRGHIZ OE

06C5 06C5ۅۅ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER WAW WITH RING WAW WITH RING

06C4 06C4ۄۄ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER TCHEH WITH DOT TCHEH WITH DOT ABOVE ABOVE

06BF 06BFڿڿ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER NOON WITH NOON WITH THREE DOTS THREE DOTS ABOVE ABOVE

06BD 06BD ڽڽ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-62
SLIDE 62

www.crulp.org 62

NO NO

PVALID PVALID

ARABIC LETTER VE ARABIC LETTER VE

06CB 06CBۋۋ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER WAW WITH TWO WAW WITH TWO DOTS ABOVE DOTS ABOVE

06CA 06CAۊۊ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER KIRGHIZ YU KIRGHIZ YU

06C9 06C9ۉۉ NO NO

PVALID PVALID

ARABIC LETTER YU ARABIC LETTER YU

06C8 06C8ۈۈ NO NO

PVALID PVALID

ARABIC LETTER U ARABIC LETTER U

06C7 06C7 ۇۇ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-63
SLIDE 63

www.crulp.org 63

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER YEH WITH THREE YEH WITH THREE DOTS BELOW DOTS BELOW

06D1 06D1ۑۑ NO NO

PVALID PVALID

ARABIC LETTER E ARABIC LETTER E

06D0 06D0ېې NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER WAW WITH DOT WAW WITH DOT ABOVE ABOVE

06CF 06CFۏۏ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER YEH WITH SMALL V YEH WITH SMALL V

06CE 06CEێێ NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER YEH WITH TAIL YEH WITH TAIL

06CD 06CDۍۍ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-64
SLIDE 64

www.crulp.org 64

NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH LAM ALEF HIGH LAM ALEF

06D9 06D9 NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH MEEM HIGH MEEM INITIAL FORM INITIAL FORM

06D8 06D8 NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH LIGATURE HIGH LIGATURE QAF WITH LAM QAF WITH LAM WITH WITH

06D7 06D7 NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH LIGATURE HIGH LIGATURE SAD WITH LAM SAD WITH LAM WITH WITH

06D6 06D6 NO NO

PVALID PVALID

ARABIC LETTER AE ARABIC LETTER AE

06D5 06D5ەە Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-65
SLIDE 65

www.crulp.org 65

NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH ROUNDED HIGH ROUNDED ZERO ZERO

06DF 06DF NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH SEEN HIGH SEEN

06DC 06DC NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH THREE DOTS HIGH THREE DOTS

06DB 06DB NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH JEEM HIGH JEEM

06DA 06DA Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-66
SLIDE 66

www.crulp.org 66

NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH MADDA HIGH MADDA

06E4 06E4 NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL LOW SEEN LOW SEEN

06E3 06E3 NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH MEEM HIGH MEEM ISOLATED FORM ISOLATED FORM

06E2 06E2 NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH DOTLESS HIGH DOTLESS HEAD OF KHAH HEAD OF KHAH

06E1 06E1 NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH UPRIGHT HIGH UPRIGHT RECTANGULAR RECTANGULAR ZERO ZERO

06E0 06E0 Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-67
SLIDE 67

www.crulp.org 67

NO NO

PVALID PVALID

ARABIC EMPTY ARABIC EMPTY CENTRE LOW STOP CENTRE LOW STOP

06EA 06EA NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH NOON HIGH NOON

06E8 06E8 NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL HIGH YEH HIGH YEH

06E7 06E7 NO NO

PVALID PVALID

ARABIC SMALL YEH ARABIC SMALL YEH

06E6 06E6 NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL WAW WAW

06E5 06E5 Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-68
SLIDE 68

www.crulp.org 68

NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER REH WITH REH WITH

06EF 06EF NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER HEH WITH HEH WITH INVERTED V INVERTED V

06FF 06FF NO NO

PVALID PVALID

ARABIC LETTER ARABIC LETTER DAL WITH DAL WITH INVERTED V INVERTED V

06EE 06EE NO NO

PVALID PVALID

ARABIC SMALL ARABIC SMALL LOW MEEM LOW MEEM

06ED 06ED NO NO

PVALID PVALID

ARABIC ROUNDED ARABIC ROUNDED HIGH STOP WITH HIGH STOP WITH FILLED CENTRE FILLED CENTRE

06EC 06EC NO NO

PVALID PVALID

ARABIC EMPTY ARABIC EMPTY CENTRE HIGH CENTRE HIGH STOP STOP

06EB 06EB Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-69
SLIDE 69

www.crulp.org 69

Urdu Language Table For IDNs Urdu Language Table For IDNs

  • Following characters are DISALLOWED by

Following characters are DISALLOWED by IDNA 200X and are not required IDNA 200X and are not required

slide-70
SLIDE 70

www.crulp.org 70

N0 N0

DISALLOWED DISALLOWED

ARABIC SIGN MISRA ARABIC SIGN MISRA

060F 060F

N0 N0

DISALLOWED DISALLOWED

ARABIC POETIC ARABIC POETIC VERSE SIGN VERSE SIGN

060E 060E

N0 N0

DISALLOWED DISALLOWED

ARABIC DATE ARABIC DATE SEPARATOR SEPARATOR

060D 060D

N0 N0

DISALLOWED DISALLOWED

ARABIC COMMA ARABIC COMMA

060C 060C

N0 N0

DISALLOWED DISALLOWED

AFGHANI SIGN AFGHANI SIGN

060B 060B

Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-71
SLIDE 71

www.crulp.org 71

N0 N0

DISALLOWED DISALLOWED

ARABIC DECIMAL ARABIC DECIMAL SEPARATOR SEPARATOR

066B 066B N0 N0

DISALLOWED DISALLOWED

ARABIC PERCENT ARABIC PERCENT SIGN SIGN

066A 066A N0 N0

DISALLOWED DISALLOWED

ARABIC QUESTION ARABIC QUESTION MARK MARK

061F 061F N0 N0

DISALLOWED DISALLOWED

ARABIC TRIPLE ARABIC TRIPLE DOT DOT PUNCTUATION PUNCTUATION MARK MARK

061E 061E N0 N0

DISALLOWED DISALLOWED

ARABIC ARABIC SEMICOLON SEMICOLON

061B 061B Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-72
SLIDE 72

www.crulp.org 72

N0 N0

DISALLOWED DISALLOWED

ARABIC LETTER U ARABIC LETTER U WITH HAMZA WITH HAMZA ABOVE ABOVE

0677 0677 N0 N0

DISALLOWED DISALLOWED

ARABIC LETTER ARABIC LETTER HIGH HAMZA YEH HIGH HAMZA YEH

0678 0678 N0 N0

DISALLOWED DISALLOWED

ARABIC LETTER ARABIC LETTER HIGH HAMZA WAW HIGH HAMZA WAW

0676 0676 N0 N0

DISALLOWED DISALLOWED

ARABIC LETTER ARABIC LETTER HIGH HAMZA ALEF HIGH HAMZA ALEF

0675 0675 N0 N0

DISALLOWED DISALLOWED

ARABIC FIVE ARABIC FIVE POINTED STAR POINTED STAR

066D 066D N0 N0

DISALLOWED DISALLOWED

ARABIC ARABIC THOUSANDS THOUSANDS SEPARATOR SEPARATOR

066C 066C Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

ٲ

slide-73
SLIDE 73

www.crulp.org 73

N0 N0

DISALLOWED DISALLOWED

ARABIC LETTER ARABIC LETTER HAH WITH HAMZA HAH WITH HAMZA ABOVE ABOVE

06D4 06D4 N0 N0

DISALLOWED DISALLOWED

ARABIC START OF ARABIC START OF RUB EL HIZB RUB EL HIZB

06DE 06DE N0 N0

DISALLOWED DISALLOWED

ARABIC LIGATURE ARABIC LIGATURE ALLAH ISOLATED ALLAH ISOLATED FORM FORM

FDF2 FDF2ﷲﷲ N0 N0

DISALLOWED DISALLOWED

ARABIC SIGN ARABIC SIGN SINDHI SINDHI AMPERSAND AMPERSAND

06FD 06FD N0 N0

DISALLOWED DISALLOWED

ARABIC PLACE OF ARABIC PLACE OF SAJDAH SAJDAH

06E9 06E9 Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-74
SLIDE 74

www.crulp.org 74

N0 N0

DISALLOWED DISALLOWED

ARABIC LIGATURE ARABIC LIGATURE ALAYHE ISOLATED ALAYHE ISOLATED FORM FORM

FDF7 FDF7ﷷﷷ N0 N0

DISALLOWED DISALLOWED

ARABIC LIGATURE ARABIC LIGATURE RASOUL ISOLATED RASOUL ISOLATED FORM FORM

FDF6 FDF6ﷶﷶ N0 N0

DISALLOWED DISALLOWED

ARABIC LIGATURE ARABIC LIGATURE SALAM ISOLATED SALAM ISOLATED FORM FORM

FDF5 FDF5 N0 N0

DISALLOWED DISALLOWED

ARABIC LIGATURE ARABIC LIGATURE MOHAMMAD MOHAMMAD ISOLATED FORM ISOLATED FORM

FDF4 FDF4ﷴﷴ N0 N0

DISALLOWED DISALLOWED

ARABIC LIGATURE ARABIC LIGATURE AKBAR ISOLATED AKBAR ISOLATED FORM FORM

FDF3 FDF3ﷳﷳ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-75
SLIDE 75

www.crulp.org 75

N0 N0

DISALLOWED DISALLOWED

ARABIC LIGATURE ARABIC LIGATURE JALLAJALALOUHOU JALLAJALALOUHOU

FDFB FDFB

¡ ¡

N0 N0

DISALLOWED DISALLOWED

ARABIC LIGATURE ARABIC LIGATURE SALLALLAHOU SALLALLAHOU ALAYHE ALAYHE WASALLAM WASALLAM

FDFA FDFA

ﷺﷺ

N0 N0

DISALLOWED DISALLOWED

ARABIC LIGATURE ARABIC LIGATURE SALLA ISOLATED SALLA ISOLATED FORM FORM

FDF9 FDF9

ﷹﷹ

N0 N0

DISALLOWED DISALLOWED

ARABIC LIGATURE ARABIC LIGATURE WASALLAM WASALLAM ISOLATED FORM ISOLATED FORM

FDF8 FDF8

ﷸﷸ

Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-76
SLIDE 76

www.crulp.org 76

Urdu IDNs Urdu IDNs

  • Following are PVALID characters by

Following are PVALID characters by IDNA200X decision and are required for Urdu IDNA200X decision and are required for Urdu

slide-77
SLIDE 77

www.crulp.org 77

YES YES PVALID PVALID

ARABIC SIGN ARABIC SIGN TAKHALLUS TAKHALLUS

0614 0614

YES YES PVALID PVALID

ARABIC SIGN RADI ARABIC SIGN RADI ALLAHOU ANHU ALLAHOU ANHU

0613 0613

YES YES PVALID PVALID

ARABIC SIGN ARABIC SIGN RAHMATULLAH RAHMATULLAH ALAYHE ALAYHE

0612 0612

YES YES PVALID PVALID

ARABIC SIGN ARABIC SIGN ALAYHE ASSALLAM ALAYHE ASSALLAM

0611 0611

YES YES PVALID PVALID

ARABIC SIGN ARABIC SIGN SALLALLAHOU SALLALLAHOU ALAYHE ALAYHE WASSALLAM WASSALLAM

0610 0610

Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-78
SLIDE 78

www.crulp.org 78

YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC LETTER ARABIC LETTER ALEF WITH HAMZA ALEF WITH HAMZA BELOW BELOW

0625 0625إإ YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER WAW WITH HAMZA WAW WITH HAMZA ABOVE ABOVE

0624 0624ؤؤ YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER ALEF WITH HAMZA ALEF WITH HAMZA ABOVE ABOVE

0623 0623أأ YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER ALEF WITH MADDA ALEF WITH MADDA ABOVE ABOVE

0622 0622ﺁﺁ YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER HAMZA HAMZA

0621 0621 ءء Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-79
SLIDE 79

www.crulp.org 79

YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER TEH TEH

062A 062Aتت YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER THEH THEH

062B 062Bثث YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC LETTER ARABIC LETTER TEH MARBUTA TEH MARBUTA

0629 0629ةة YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER BEH BEH

0628 0628بب YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER ALEF ALEF

0627 0627اا YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC LETTER ARABIC LETTER YEH WITH HAMZA YEH WITH HAMZA ABOVE ABOVE

0626 0626ئئ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

ٲ

slide-80
SLIDE 80

www.crulp.org 80

YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER JEEM JEEM

062C 062Cجج YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER HAH HAH

062D 062Dحح YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER THAL THAL

0630 0630ذذ YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER DAL DAL

062F 062Fدد YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER KHAH KHAH

062E 062Eخخ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-81
SLIDE 81

www.crulp.org 81

YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER SAD SAD

0635 0635صص YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER SHEEN SHEEN

0634 0634شش YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER SEEN SEEN

0633 0633سس YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER ZAIN ZAIN

0632 0632زز YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER REH REH

0631 0631رر Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-82
SLIDE 82

www.crulp.org 82

YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER ZAH ZAH

0638 0638ظظ YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER AIN AIN

0639 0639عع YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER TAH TAH

0637 0637طط YES YES PVALID PVALID

ARABIC LETTER ARABIC LETTER DAD DAD

0636 0636ضض Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-83
SLIDE 83

www.crulp.org 83

YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC LETTER ARABIC LETTER KAF KAF

0643 0643كك YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER QAF QAF

0642 0642قق YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER FEH FEH

0641 0641

ف ف

YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER GHAIN GHAIN

063A 063Aغغ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-84
SLIDE 84

www.crulp.org 84

YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER WAW WAW

0648 0648وو YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC LETTER ARABIC LETTER HEH HEH

0647 0647ﻩﻩ YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER NOON NOON

0646 0646نن YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER MEEM MEEM

0645 0645مم YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER LAM LAM

0644 0644لل Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-85
SLIDE 85

www.crulp.org 85

YES YES

PVALID PVALID

ARABIC ARABIC DAMMATAN DAMMATAN

064C 064C YES YES

PVALID PVALID

ARABIC KASRATAN ARABIC KASRATAN

064D 064D YES YES

PVALID PVALID

ARABIC FATHATAN ARABIC FATHATAN

064B 064B YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC LETTER ARABIC LETTER YEH YEH

064A 064Aيي YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC LETTER ARABIC LETTER ALEF MAKSURA ALEF MAKSURA

0649 0649ىى Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-86
SLIDE 86

www.crulp.org 86

YES YES

PVALID PVALID

ARABIC SUKUN ARABIC SUKUN

0652 0652 YES YES

PVALID PVALID

ARABIC SHADDA ARABIC SHADDA

0651 0651 YES YES

PVALID PVALID

ARABIC KASRA ARABIC KASRA

0650 0650 YES YES

PVALID PVALID

ARABIC DAMMA ARABIC DAMMA

064F 064F YES YES

PVALID PVALID

ARABIC FATHA ARABIC FATHA

064E 064E Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-87
SLIDE 87

www.crulp.org 87

YES YES

PVALID PVALID

ARABIC MARK ARABIC MARK NOON NOON GHUNNA GHUNNA

0658 0658 YES YES

PVALID PVALID

ARABIC SUBSCRIPT ARABIC SUBSCRIPT ALEF ALEF

0656 0656 YES YES

PVALID PVALID

ARABIC HAMZA ARABIC HAMZA BELOW BELOW

0655 0655 YES YES

PVALID PVALID

ARABIC HAMZA ARABIC HAMZA ABOVE ABOVE

0654 0654 YES YES

PVALID PVALID

ARABIC MADDAH ARABIC MADDAH ABOVE ABOVE

0653 0653 Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-88
SLIDE 88

www.crulp.org 88

YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC ARABIC-

  • INDIC

INDIC DIGIT TWO DIGIT TWO

0662 0662

٢ ٢

YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC ARABIC-

  • INDIC

INDIC DIGIT THREE DIGIT THREE

0663 0663

٣ ٣

YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC ARABIC-

  • INDIC

INDIC DIGIT ONE DIGIT ONE

0661 0661

١ ١

YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC ARABIC-

  • INDIC

INDIC DIGIT ZERO DIGIT ZERO

0660 0660

٠ ٠

Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-89
SLIDE 89

www.crulp.org 89

YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC ARABIC-

  • INDIC

INDIC DIGIT EIGHT DIGIT EIGHT

0668 0668

٨ ٨

YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC ARABIC-

  • INDIC

INDIC DIGIT SEVEN DIGIT SEVEN

0667 0667

٧ ٧

YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC ARABIC-

  • INDIC

INDIC DIGIT SIX DIGIT SIX

0666 0666

٦ ٦

YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC ARABIC-

  • INDIC

INDIC DIGIT FIVE DIGIT FIVE

0665 0665

٥ ٥

YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC ARABIC-

  • INDIC

INDIC DIGIT FOUR DIGIT FOUR

0664 0664

٤ ٤

Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-90
SLIDE 90

www.crulp.org 90

YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER PEH PEH

067E 067Eپپ YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER TCHEH TCHEH

0686 0686چچ YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER DDAL DDAL

0688 0688ڈڈ YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER TTEH TTEH

0679 0679ٹٹ YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER SUPERSCRIPT ALEF SUPERSCRIPT ALEF

0670 0670 YES YES

(Variant of Base (Variant of Base Character) Character)

PVALID PVALID

ARABIC ARABIC-

  • INDIC

INDIC DIGIT NINE DIGIT NINE

0669 0669

٩ ٩

Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-91
SLIDE 91

www.crulp.org 91

YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER NOON GHUNNA NOON GHUNNA

06BA 06BAںں YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER GAF GAF

06AF 06AFگگ YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER KEHEH KEHEH

06A9 06A9کک YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER JEH JEH

0698 0698ژژ YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER RREH RREH

0691 0691 ڑڑ Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode Character Character

slide-92
SLIDE 92

www.crulp.org 92

YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER TEH MARBUTA TEH MARBUTA GOAL GOAL

06C3 06C3ۃۃ YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER HEH GOAL WITH HEH GOAL WITH HAMZA HAMZA ABOVE ABOVE

06C2 06C2ۂۂ YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER HEH GOA HEH GOA

06C1 06C1ﮦﮦ YES YES

(Variant of Base (Variant of Base character) character)

PVALID PVALID

ARABIC LETTER ARABIC LETTER HEH WITH YEH HEH WITH YEH ABOVE ABOVE

06C0 06C0تت YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER HEH HEH DOACHASHMEE DOACHASHMEE

06BE 06BE هه Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-93
SLIDE 93

www.crulp.org 93

YES YES

PVALID PVALID

EXTENDED EXTENDED ARABIC ARABIC-

  • INDIC

INDIC DIGIT ZERO DIGIT ZERO

06F0 06F0

٠ ٠

YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER YEH BARREE WITH YEH BARREE WITH HAMZA ABOVE HAMZA ABOVE

06D3 06D3ۓۓ YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER YEH BARREE YEH BARREE

06D2 06D2ےے YES YES

PVALID PVALID

ARABIC LETTER ARABIC LETTER FARSI YEH FARSI YEH

06CC 06CCیی Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-94
SLIDE 94

www.crulp.org 94

YES YES

PVALID PVALID

EXTENDED EXTENDED ARABIC ARABIC-

  • INDIC

INDIC DIGIT FIVE DIGIT FIVE

06F5 06F5

۵ ۵

YES YES

PVALID PVALID

EXTENDED EXTENDED ARABIC ARABIC-

  • INDIC

INDIC DIGIT FOUR DIGIT FOUR

06F4 06F4

۴ ۴

YES YES

PVALID PVALID

EXTENDED EXTENDED ARABIC ARABIC-

  • INDIC

INDIC DIGIT THREE DIGIT THREE

06F3 06F3

٣ ٣

YES YES

PVALID PVALID

EXTENDED EXTENDED ARABIC ARABIC-

  • INDIC

INDIC DIGIT TWO DIGIT TWO

06F2 06F2

٢ ٢

YES YES

PVALID PVALID

EXTENDED EXTENDED ARABIC ARABIC-

  • INDIC

INDIC DIGIT ONE DIGIT ONE

06F1 06F1

١ ١

Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-95
SLIDE 95

www.crulp.org 95

YES YES

PVALID PVALID

EXTENDED EXTENDED ARABIC ARABIC-

  • INDIC

INDIC DIGIT EIGHT DIGIT EIGHT

06F8 06F8

٨ ٨

YES YES

PVALID PVALID

EXTENDED EXTENDED ARABIC ARABIC-

  • INDIC

INDIC DIGIT NINE DIGIT NINE

06F9 06F9

٩ ٩

YES YES

PVALID PVALID

EXTENDED EXTENDED ARABIC ARABIC-

  • INDIC

INDIC DIGIT SEVEN DIGIT SEVEN

06F7 06F7

٧ ٧

YES YES

PVALID PVALID

EXTENDED EXTENDED ARABIC ARABIC-

  • INDIC

INDIC DIGIT SIX DIGIT SIX

06F6 06F6

۶ ۶

Recommen Recommen-

  • dation

dation Current Current status in status in IDNA 200x IDNA 200x Description Description Unicode Unicode

Character Character

slide-96
SLIDE 96

www.crulp.org 96

Afternoon Session Afternoon Session

  • Sample language tables for the following languages

Sample language tables for the following languages

  • Balochi

Balochi

  • Pashto

Pashto

  • Punjabi

Punjabi

  • Seraiki

Seraiki

  • Sindhi

Sindhi

  • Urdu

Urdu

  • Torwali

Torwali

  • Collective Issues for multiple languages

Collective Issues for multiple languages

slide-97
SLIDE 97

www.crulp.org 97

Collective Issues Collective Issues

  • Separator

Separator

  • Diacritics

Diacritics

  • Honorifics

Honorifics

  • Confusable characters

Confusable characters

  • Kaf

Kaf

  • Yay

Yay

  • Hay

Hay

  • Gol

Gol Tay Tay

  • Others?

Others?

  • Digits

Digits

  • Space/ZWNJ

Space/ZWNJ

  • Blocking vs. Bundling

Blocking vs. Bundling