You are not logged in.

#1 2010-08-06 09:05:30

stephankn
Moderator
Registered: 2010-05-04
Posts: 621

Software for romanization of Thai script

Hi,

I'm looking for a software to do batch romanization of Thai script. As RTGS is considered standard, RTGS is the preferred way.

I know the windows tool from
http://www.arts.chula.ac.th/~ling/tts/

I need it for Linux. Better would be a portable solution, maybe java.

Is there any free library? Alternatively a commercial library that grants a free license for the purposes of OpenStreetMap.

For a bilingual map currently name:en is used.
This could be complemented by romanized form of names.

So instead of Sukhumvit Road it would state Thanon Sukhumvit

I think this could be complemented by an lookup table to not romanize English loan words. Maybe someone know more in detail what RTGS mentiones regarding this.

I saw an English document describing RTGS in very detail but lost it.

Stephan

Offline

#2 2010-08-06 10:01:12

Willi2006
Member
From: Khon Kaen, Thailand
Registered: 2009-12-12
Posts: 421
Website

Re: Software for romanization of Thai script

I would like to repeat my post in the thread Name tags for brand names like "BigC", "TESCO Lotus" or "Shell" dated 2010-05-12 11:39:13.

Willi2006 wrote:
stephankn wrote:

When romanization can be done automatically, the English tag could be used for the English name.

Unfortunately the automation is not trivial. I'm using the program which is also listed on the Wikiprojekt Thailand page Chulalongkorn University: Thai Romanization Program almost one year. It's version 1.4 now. Even when there's a romanization available I like to compare it with the programs romanization. I've experienced these issues:

  • Sometimes the romanization of the program is wrong.
    These cases are rarely but they happen. I always ask a native speaker to look at the error.

  • Sometimes the program has another romanization than on the sign.
    This happens more often and is due to the fact that the rules have changed and even the government does not follow the rules. You can also find different romanization on different signs of the same road.
    I take the programs version very rarely: If it is not the name of a company, brand, government office etc. and the romanization on the sign is definitely wrong. I keep in mind that people using the map will compare with the romanization on the sign and not enter the Thai name from the sign into a program and compare the output.

    Thus if you can't find the romanization on the internet you have to be on the spot.

Offline

#3 2010-08-06 21:19:25

stephankn
Moderator
Registered: 2010-05-04
Posts: 621

Re: Software for romanization of Thai script

Hello Willi2006,

The tool you mention is the same I already know. First it's windows, second it's free only for personal non-commercial use. So not compatible with the OSM license.

I'm also aware of the brand names discussion. That's why I think a combination with a lookup table can help a lot.

Most names should be streets and places. In both cases it's nothing that can be handled with the strategy from the brand name discussion.
I'm very confident that to be useful for Thai people the map has to be in Thai. And then we need a way to make it readable for foreigners.

Please let us discuss this in another Thread if there is a need to do so. In this thread I hope to get answers for a software that can do the romanization. I don't want to write it on my own.

Stephan

Offline

#4 2010-09-01 18:50:49

Melina
Member
Registered: 2010-08-31
Posts: 1

Re: Software for romanization of Thai script

Hi there,

This may be a dumb question, but I'm going to risk asking it anyway:
I downloaded that Thai romanization program on my Windows PC, version 1.4 from Chulalongkorn Univ. However, when I open this Word Document I have with an article written in Thai characters and proceed to cut & paste the Thai text into the Romanization program in order for me to read it ( I can speak Thai fluently but can't read or write) , it shows up as a bunch of question marks, eg "???????".  Do you know if I need to install special system fonts or something? I assumed that if I could open a Word file and view it written in Thai, that my computer recognized the code.... In WORD, the Thai font comes up as "Cordia new" or "Cordia UPC".
Thanks is advance for your help!

Offline

#5 2010-09-09 08:58:59

stephankn
Moderator
Registered: 2010-05-04
Posts: 621

Re: Software for romanization of Thai script

@Melina: I can confirm this behavior. Happens here with Vista, too. I think they are using a wrong codepage (874?). Using the text-file conversion method still works.

This tool is not Unicode enabled. Setting the locale to Thai makes it work as expected.

I used Microsoft AppLocale to set the locale to Thai.

http://www.microsoft.com/downloads/en/d … laylang=en

Works fine with Vista. You have to run the AppLocale installer as administrator.

edit: added applocale

Last edited by stephankn (2010-09-09 09:13:13)

Offline

Board footer

Powered by FluxBB