The above video goes away if you are a member and logged in, so log in now!
@oksky022: ...no problem, actually, but first I would like to have "the list" I've described in the post #129. Without it I will not do anything ...
- Chinese Traditional (Big 5)
- 13051 distinct character
I just know it is code page 950 but i don't know how to construct the code page of total 13051 character ,may be magusneo can help . However ,I will still try to search on the net ,hope i can help .
@hip203:...actually, I understand nothing in the bitmaps pointed by these links. Where the hell thousands of symbols there? ...
unicode chinese character that is identified on the bitmap
8140 = U+4E02
This is just one of character ....so for chinese character there is really thousand of character......And we actually need all the character
This is the table chinese
Every sublist there is a table for another huge table of chinese character .......
So all of these need to intrepreted in unicode/and all the charcter drawing need to be intrepreted in bitmap?
@hip203: OK, I see. Thank you. In which form subtitle files are stored in most cases for Eastern languages? Is it unicode or MBCS (like stated at aforementioned tables?) Actually, 2 things should happen:
- font creation by MtkFontCreator utility (here I need the list I've mentioned earlier to make .ini file for this utility in order to produce font bitmaps). Having the information from links you provided is just 50/<number of characters+overhead>% of work , since the list (just a text file with numbers) must be created by someone;
- if subtitle files use MBCS, then another list or table (or algorithm, I'm not sure yet) which maps MBCS codes to unicode has to be created;
Now you can imagine how much time would it cost to make such a lists. Someone must just sit, look at the Microsoft's page, and type numbers very carefully, since it would be quite problematic to find a mistake in such a huge lists. Actual programming becomes easiest part of the whole process, I think . Another issue is memory requirements. Each character takes 512 bytes of memory in uncompressed state. For 8000 characters there will be ~4MB, which is still acceptable, I think, but it depends on movie's resolution. For some hi-res movies there could be a problem...
This dat file I got from a chinese program
It has all the unicode code ....
need some time to do adjustment to the file to create a .ini file
@redtiger: ...that's what I needed in the first place. Thank you. @'hip203': no need to make further adjustments, I'll do that, no problem. @both of you: what's the difference between 'pinyin.zip' and 'MTK_big5.zip'? Which one shall be chosen? About priorities: as you probably know, I'm a professional programmer, so first priority will be given to the job I paid for . According to the poll results SMS development shall go for the new formats (I'm busy with mpeg, but I won't be able to compete it in the near future). So, I can't promise you quick Eastern subs implementation, but I'll definitely pay my attention at it, so, first I'll try to scratch some routines which performs just text drawing, then, we will see, OK? ...