Velocity Reviews - Computer Hardware Reviews

Velocity Reviews > Newsgroups > Programming > Java > Re: Isn't java.lang.Character.html#{ isLetterFromLang(int codePoint,String ISOLangDef) missing from the spec?

Reply
Thread Tools

Re: Isn't java.lang.Character.html#{ isLetterFromLang(int codePoint,String ISOLangDef) missing from the spec?

 
 
Arne Vajhøj
Guest
Posts: n/a
 
      12-05-2010
On 04-12-2010 19:16, wrote:
> One possibly (and easily ) could based on the Unicode code points
>

check the ranges for each language, but I think it would be very useful
> for people parsing text from different languages.


It is not there.

But I am not sure that it is missing.

The concept will be fundamentally broken if one language
has more than one alphabet (I don't know if such case exist,
but it could).

And the benefits are very limited given the practice
of writing names as they are in their native language
even though the letters are not used in the language
of the text.

Arne
 
Reply With Quote
 
 
 
 
Lew
Guest
Posts: n/a
 
      12-05-2010
wrote:
> One possibly (and easily ) could based on the Unicode code points
>
> check the ranges for each language, but I think it would be very useful
>> for people parsing text from different languages.


Arne Vajhøj wrote:
> It is not there.
>
> But I am not sure that it is missing.
>
> The concept will be fundamentally broken if one language
> has more than one alphabet (I don't know if such case exist,
> but it could).


Japanese has more than one alphabet.

It's also broken if more than one language shares the same alphabet, or parts
thereof, as many European languages do.

> And the benefits are very limited given the practice
> of writing names as they are in their native language
> even though the letters are not used in the language
> of the text.


Also there are changes within a language, often deployed with considerable
élan, such as an added soupçon of borrowed terminology, or abandonment of
symbols from its noösphere, both manœuvres that English has undergone.

--
Lew
 
Reply With Quote
 
 
 
 
Arne Vajhøj
Guest
Posts: n/a
 
      12-05-2010
On 04-12-2010 21:36, Lew wrote:
> wrote:
>> One possibly (and easily ) could based on the Unicode code points
>>
>> check the ranges for each language, but I think it would be very useful
>>> for people parsing text from different languages.

>
> Arne Vajhøj wrote:
>> It is not there.
>>
>> But I am not sure that it is missing.
>>
>> The concept will be fundamentally broken if one language
>> has more than one alphabet (I don't know if such case exist,
>> but it could).

>
> Japanese has more than one alphabet.


That breaks the idea.

> It's also broken if more than one language shares the same alphabet, or
> parts thereof, as many European languages do.


The function would work in this case - it would just return
true for more than one language for a lot of codepoints.

>> And the benefits are very limited given the practice
>> of writing names as they are in their native language
>> even though the letters are not used in the language
>> of the text.

>
> Also there are changes within a language, often deployed with
> considerable élan, such as an added soupçon of borrowed terminology, or
> abandonment of symbols from its noösphere, both manœuvres that English
> has undergone.


Yup.

Arne

 
Reply With Quote
 
 
 
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Crystal Reports - Visual Basic UFL that implements this function is missing (or U2lcom.dll is missing) Les Caudle ASP .Net 3 09-03-2007 02:27 AM
Re: missing feature classes and missing fields Gary Herron Python 2 07-04-2006 10:29 PM
missing wzcdlg.dll =?Utf-8?B?RGFuZGVl?= Wireless Networking 4 11-20-2004 02:14 PM
Missing Wireless Link applet =?Utf-8?B?Q2hyaXNzaWU=?= Wireless Networking 8 10-07-2004 07:24 PM
Missing Websites and no mail =?Utf-8?B?U2hpZnR3b3JrZXI0Mw==?= Wireless Networking 0 09-11-2004 12:31 PM



Advertisments
 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57