[smc-discuss] Re: MALAYALAM OCR

Thu Jun 12 04:29:16 PDT 2008

Discussion of Indic language support in Tesseract OCR

---------- Forwarded message ----------
From: Basil Brassily Thoppil <basilbthoppil at gmail.com>
Date: 2008/6/12
Subject: dddd script/indic script support for TESSERACT
To: basilbthoppil at gmail.com

---------- Forwarded message ----------
From: Basil Brassily Thoppil <basilbthoppil at gmail.com>
Date: 2008/6/12
Subject: Fwd: script/indic script support for TESSERACT
To: basilbthoppil at gmail.com

Forwarded conversation
Subject: script/indic script support for TESSERACT
------------------------

From: *Basil B Thoppil* <basilbthoppil at gmail.com>
Date: Sun, Mar 9, 2008 at 8:18 PM
To: tesseract-ocr at googlegroups.com

Hello everyone,

I am planning to give indic language support for Tesseract OCR.

Is it possible to do complex script/indic script with the latest version of
Tesseract.?

regards,
basil
----------
From: *74yrs old* <withblessings at gmail.com>
Date: Sun, Mar 9, 2008 at 10:52 PM
To: tesseract-ocr at googlegroups.com

Yes, it is possible.

----------
From: *Basil B Thoppil* <basilbthoppil at gmail.com>
Date: Sun, Mar 9, 2008 at 11:05 PM
To: tesseract-ocr at googlegroups.com

Hello,
I am a newbie to Tesseract. As far as I know there is the problem of
overlaping characters in Indic languages. How this is handled in
Tesseract(How does the box is formed?).

Please mension more differences or problems we face when work with
Tesseract.

regards
basil
----------
From: *Ray Smith* <theraysmith at gmail.com>
Date: Sat, Apr 19, 2008 at 4:53 AM
To: tesseract-ocr at googlegroups.com

Which Indic languages are you particularly interested in?
Do you have any test data? (scanned images with the corresponding text.)
I have done a lot of work with Kannada, but hit a bit of a stumbling block
with its inherent ambiguity. Some of the other Indic languages might be
easier.
The Kannda work has led to a lot of changes that make it at least possible
to consider the other Indic languages (and Chinese too).
Ray.
----------
From: *Basil Brassily Thoppil* <basilbthoppil at gmail.com>
Date: Sun, Apr 20, 2008 at 10:11 AM
To: tesseract-ocr at googlegroups.com

Hi,
I am planning to do it for Malayalam. I dont know the case with kannada. The
study is going on. There are some scanned images of some old books. Now I am
planning to make some new scanned images.

There are some similar characters there in Malayalam which may cause
ambiguity, but I hope it is solvable  using DangAmbigous  file.

regards,
basil
----------
From: *74yrs old* <withblessings at gmail.com>
Date: Sun, Apr 20, 2008 at 12:23 PM
To: tesseract-ocr at googlegroups.com

Please compare unicode for Malayalam with unicode for Kannada  (unicode.org
).
From the code chart of Malayalam, it  is observed that Malayalam script has
dependent vowels similar to  Kannada script.  If you are familiar with
baraha software(windows platform), you can test Malayalam and Kannada
scripts - by using transliteration from Malayalam to Kannada or Kannada to
Malayalam  This will prove that Malayalam can trained similar to Kannada
easily
 please upload sample text of  few character script (alphabets) of
combination of consonants plus dependent vowels for testing..
----------
From: *onkarswamy* <onkarswamy at gmail.com>
Date: Mon, Apr 21, 2008 at 5:36 PM
To: Basil B Thoppil <basilbthoppil at gmail.com>

hi sir i am also new to tesseract.i am also planing to develop ocr for
kannada language(indic).please tell me how u proceed in this context.

i have developed an on screen keyboard for kannada using BRHKND font

i am currently in mysore ,sjce .
----------
From: *Basil Brassily Thoppil* <basilbthoppil at gmail.com>
Date: Tue, Apr 22, 2008 at 11:49 AM
To: onkarswamy <onkarswamy at gmail.com>

Hi,
This is the scanned image of  Malayalam.
----------
From: *Basil Brassily Thoppil* <basilbthoppil at gmail.com>
Date: Fri, Apr 25, 2008 at 12:46 PM
To: onkarswamy <onkarswamy at gmail.com>

Hi
Hope you saw the document of Malayalam which I have attached.

regards,
basil
----------
From: *Basil Brassily Thoppil* <basilbthoppil at gmail.com>
Date: Fri, Apr 25, 2008 at 12:46 PM
To: tesseract-ocr at googlegroups.com

----------
From: *Basil Brassily Thoppil* <basilbthoppil at gmail.com>
Date: Fri, Apr 25, 2008 at 12:49 PM
To: tesseract-ocr at googlegroups.com

Hi,
Very sorry for the delay. It was wrongly sent to Mr Onkarswamy.
Please see the document and reply.

regards
basil
----------
From: *onkarswamy angadi* <onkarswamy at gmail.com>
Date: Fri, Apr 25, 2008 at 3:16 PM
To: Basil Brassily Thoppil <basilbthoppil at gmail.com>

ya sir ,
what is my next step regarding the image?can u give some literature(or
links) to read and get into next step.
i am in mysore.where r u residing?
----------
From: *74yrs old* <withblessings at gmail.com>
Date: Mon, Apr 28, 2008 at 12:11 PM
To: tesseract-ocr at googlegroups.com

Hi Basil,
I am using MSwindows. Problem is I could not  locte Malayalam fonts  nor in
Character Map in XP, as such I could not perform test nor send you a sample
datafiles of Malayalam.  I shall try again.
----------
From: *74yrs old* <withblessings at gmail.com>
Date: 2008/4/28
To: tesseract-ocr at googlegroups.com, Basil Brassily Thoppil <
basilbthoppil at gmail.com>

Hi Basil,
I have suceeded 4 lang. datafiles for tesseract-ocr as experiment. - vide
attached files (1) 4lang.bmp(2)4lan.box(3)output text viz. 4lang.txt.
please compare output text viz.4lang.txt with 4lang.bmp file and 80% appears
to be correct for malayam whereas 100% correct for Hindi and Kannada-80%.
Since I don't know Malayalam script, please check and indicate number of
mistakes occured.
----------
From: *74yrs old* <withblessings at gmail.com>
Date: Mon, Apr 28, 2008 at 9:30 PM
To: tesseract-ocr at googlegroups.com

It is proved that indic langs can be trained in tesseract-ocr easily. Ray
still working on to improve  tesseract-ocr for Kannada. Once completed, I
trust that all other Indian langs. can  easily be trained in tesseract-ocr
in the next version of tesseract-ocr. At present I have tested
tesseract-ocr  in MSwindows platform.
----------
From: *dikar* <zhuhengguo at gmail.com>
Date: Thu, May 1, 2008 at 12:15 PM
To: tesseract-ocr <tesseract-ocr at googlegroups.com>

hi 74yrs old,
you do a wonderful work!
how do you finished that?
I always get a error like "
 read_variables_file:variable not found: textord_no_rejectsTesseract
Open Source OCR Engine
 Image has 24 bits per pixel and size (110,41)
 Resolution=96
 APPLY_BOXES: FATALITY - 0 labelled samples of "1" - target is 1
 APPLY_BOXES: FATALITY - 0 labelled samples of "0" - target is 2
 APPLY_BOXES:
  Boxes read from boxfile:       3
  Initially labelled blobs:      0 in 0 rows
  Box failures detected:                    3
  Duped blobs for rebalance:     0
  "1" has fewest samples:     0
                               Total unlabelled words:        0
                               Final labelled words:          0
 Generating training data
 Generated training data for 0 blobs
"
I'm crazy almost.
>  4lang.bmp
> 1899KDownload
>
>  4lang.box
> 1KDownload
>
> [4lang.txt]ka kA ki kI ku kU kIkU kAki ke kE kaki ko kEà¤•à¥‡ kaà²•à³‚
kakA kaki
> à²• à²•à²¾ à²•à²¿ à²•à³€ à²•à³  à²•à³‚ à²•à³ƒ à²•à³„ à²•à³† à²•à³‡ à²•à³ˆ
à²•à³Š à²•à³Šà²•à²¾ à²•à³Œ à²•à²‚ à²•à²ƒ
> à¤•à¤‚ à¤•à¤¾ à¤•à¤¿ à¤•à¥€ à¤•à¥  à¤•à¥‚ à¤•à¥ƒ à¤•à¥„ à¤•à¥‡ à¤•à¥†
à¤•à¥ˆ à¤•à¥‹ à¤•à¥Š à¤•à¥Œ à¤•à¤‚ à¤•à¤ƒ
> à´• à´•à´¾ à´•à´¿ à´•àµ€ à´•àµ  à´•àµ‚ à´•àµƒ à´•à´•àµ  à²•à²¾à´• à´•àµ‡
à´•àµ†à´• à²•à²¾à´•à´¾ à²•à²¾à´•à´¾ à²•à²¾à´•à²•à²¾ à´•à´‚ à´•à´ƒ
>
>  4lang.tr
> 182KDownload
----------
From: *74yrs old* <withblessings at gmail.com>
Date: Thu, May 1, 2008 at 1:27 PM
To: tesseract-ocr at googlegroups.com

Dikar,
Which language you are trying?will you forward image with text for perusal.
----------
From: *74yrs old* <withblessings at gmail.com>
Date: Thu, May 1, 2008 at 3:34 PM
To: tesseract-ocr at googlegroups.com

Dikar,
Please forward your trained material like  image, box, text, tesseract.log
files etc- which you find  error. I shall try to locate where mistake
happens and feedback to you.
It may also indicate which operating system( like ubuntu or MSwindows etc)
in which you tried;
 --

----------
From: *Zhu Hengguo* <zhuhengguo at gmail.com>
Date: Thu, May 1, 2008 at 9:41 PM
To: tesseract-ocr at googlegroups.com

I'm training some digital with handwriting.

when I use " tesseract fontfile.tif junk nobatch box.train"  to make .tr
file.

Program always metioned need
•    tessdata/eng.freq-dawg
•    tessdata/eng.word-dawg
•    tessdata/eng.user-words
•    tessdata/eng.inttemp
•    tessdata/eng.normproto
•    tessdata/eng.pffmtable
•    tessdata/eng.unicharset
•    tessdata/eng.DangAmbigs

these files.
But before tr files have finished , inttemp ,pffmtable,normproto files have
not prepared yet.
When copy some examples and rename it as eng.inttemp  , eng.normproto  ,
eng.pffmtable; it report FATALITY occurs.
If use cnTraining.exe to make a null file in advance,it report error also.
So ,I'm failed to create .tr files

I just want to have a try on it ,so the box file and tif file is very simple
.
like that "1","0"
----------
From: *Basil Brassily Thoppil* <basilbthoppil at gmail.com>
Date: Fri, May 2, 2008 at 12:08 AM
To: tesseract-ocr at googlegroups.com

Hi 74 years old,

Very happy to see this much enthusiasm from your side for Indic languages.
Thank you, thank you, very much.
I also experimented with Malayalam and succeded, and I did in Gnu/Linux
platform.
I will attach the files in the next mail.

Thank you very much...
Basil
----------
From: *Basil Brassily Thoppil* <basilbthoppil at gmail.com>
Date: Fri, May 2, 2008 at 12:13 AM
To: 74yrs old <withblessings at gmail.com>

Hi 74yrs old,

Will you please send me some training images for Hindi, that you used for
training.!
I have some doubts.
regards
basil
----------
From: *74yrs old* <withblessings at gmail.com>
Date: Fri, May 2, 2008 at 1:37 PM
To: tesseract-ocr at googlegroups.com

Hi Basil,
Glad to note you have succeeded Malayalam - congratulations. In fact I don't
know Malayalam. In MSwindow - by using baraha software, I translated kannada
to Malayalam and other languages and trained to prove that indic can be
trained in tesseract-ocr easily.

In fact  some of relevant source codes in tesseract-ocr have been modified
by Ray, taking consideration of complexity  in Kannada Script  like
combination  of consonants  plus  dependent vowels,  number of characters
etc. I cannot explain since not I am not programmer nor developer.

All credit should go to Mr.Ray, owner of Tesseract-ocr, who has taken very
much interest in developing Kannada project - which is most complex script
among the Indic including other world languages  which has combination of
consonants plus dependent vowels etc. Still research is going on to perfect
the relevant source code by Ray. so that all world languages can be trained
easily in tesseract-ocr to the extent possible.

You can try out in MSplatform also where baraha software (www.baraha.com) is
available to train any other indic languages (using transliteration) easily.
With best Wishes.
----------
From: *74yrs old* <withblessings at gmail.com>
Date: Fri, May 2, 2008 at 2:33 PM
To: Basil Brassily Thoppil <basilbthoppil at gmail.com>

Hi,
Please see attached file "4lang.bmp" forwarded to you on April, 28 - in
which contains images of Hindi, Malayalam, Kannada etc   What is your doubt?
-sriranga(75yrsold)
----------
From: *74yrs old* <withblessings at gmail.com>
Date: Fri, May 2, 2008 at 4:24 PM
To: tesseract-ocr at googlegroups.com

Dikar,
Please find attached files. Output( hand.txt) agree with the smaple
handwriting image(a.bmp). suceeded in handwriting test.

1000

----------
From: *Zhu Hengguo* <zhuhengguo at gmail.com>
Date: Fri, May 2, 2008 at 5:13 PM
To: tesseract-ocr at googlegroups.com

Wow,magic! thanks a lot!

2008/5/2 74yrs old <withblessings at gmail.com>:

> 1000
>
>
> ----------
From: *Zhu Hengguo* <zhuhengguo at gmail.com>
Date: Fri, May 2, 2008 at 7:48 PM
To: tesseract-ocr at googlegroups.com

hi 74yrs old,
What I meet is strange, but it works.
The reason is :
1 when   you can do three things ,it maybe can work
        1.1 change .tif file to .bmp file (why, I don't know)
        1.2 if tif file support 24bit color ,change it to 1bit color.
        1.3 add something in command like that:
                   tesseract fontfile.tif junk -l aba nobatch box.train
              aba is a language which you want to training.
              Note,if it metioned "need unicharset,...",you can use other
tools(unicharset_extractor) make it in advance.
              Note,another strange thing is when build 2.03 version,  many
thing work good but wordlist2dawg use a large mount memory and not stop.When
I use old version,it works.

2 when
            tesseract image.tif output -l lang
    the output is empty.when I use image.bmp, it works!

thanks a lot!
And also thanks theraysmith and others, this ocr is a good ocr.
----------
From: *74yrs old* <withblessings at gmail.com>
Date: Fri, May 2, 2008 at 11:37 PM
To: tesseract-ocr at googlegroups.com

Zhu,
*Reg:wordlist_word/freq_drawig in tesseract2-03*:  Yes,there is  problem as
you reported - I have already brought to the kind notice of Ray. Wait for
the patch. In fact, I have tested in tesseract 2.01 only.

Reg:1.1: tif was scanned at 96 dpi -as such blank. If you had scanned at 300
dpi, it would have work.  With help of Paint brush  I saved "a.tif" as
"a.bmp file".

Reg. 1.2 Here not question of 24 colors,but question of dpi - morever tif
should be uncompressed plus scanned at least 300dpi or more.

Reg 1.3  I have not added anything in the commandline. I followed wiki
instructions for training. All generated datafiles should be prefixed  three
characters like aba or han etc.- for example generated datafile as
"unicharset" - for which you have to prefix like "aba.unicharset" or
han.unicharset etc.

Reg:2 Yes because tif was scanned at 96 dpi and
not 300- dpi. as such output was empty.in your case.

Please note that for handwriting - it requires more training for single
character due to variation of shape,size etc while writing. . I feel
steady/uniform handwritings  may succeed
----------
From: *Basil Brassily Thoppil* <basilbthoppil at gmail.com>
Date: Sat, May 3, 2008 at 12:38 PM
To: tesseract-ocr at googlegroups.com

Hi 78yrs old,
I havent done the work completely. It was just for experimentation purpose.
I trained some single characters only. But there are a lot of combinations
of letters and many rendering problems in Malayalam. We have to consider
that also.

for eg.

ഗ +  ്  + ന   becomes  ഗ്ന I am just coming to that part. Given for your
research purpose.

regards
basil.
----------
From: *Basil Brassily Thoppil* <basilbthoppil at gmail.com>
Date: Sat, May 3, 2008 at 12:47 PM
To: tesseract-ocr at googlegroups.com

Hi,
about Hindi,
please see the attached image of a Hindi document.
There is a line connecting the letters of each word. Then how does tesseract
idenntify a single letter from a document? Is there any special feature .?
regards
basil
----------
From: *74yrs old* <withblessings at gmail.com>
Date: 2008/5/3
To: tesseract-ocr at googlegroups.com

Basil,
As an example please attached file - which is self explanatory about each
character has combinations of consonants plus vowel. and such each character
has to be trained -.read with wiki about training -.and generate  Malayalam
datafiles (similar to English datafiles) etc.

I can understand your example - which  have already been considered  by Ray,
for the latest version of tesseract-2.03.  As such there may not any problem
with Malayam  which is similar to Kannada.

Trust you have hands-on experience  with the English (phototest.tif) and
also  with english words or text you might have experimented.  .

അ ആ ഇ ഈ ഉ ഊ ഋ ൠ എ ഏ ഐ ഒ ഓ ഔ അം അഃ
ക കാ കി കീ കു കൂ കൃ കൠ കെ കേ കൈ കൊ കോ കൌ കം കഃ
ഗ ഗാ ഗി ഗീ ഗു ഗൂ ഗൃ ഗൠ ഗെ ഗേ ഗൈ ഗൊ ഗോ ഗൌ ഗം ഗഃ
ഖ ഖാ ഖി ഖീ ഖു ഖൂ ഖൃ ഖൠ ഖെ ഖേ ഖൈ ഖൊ ഖോ ഖൌ ഖം ഖഃ
ഘ ഘാ ഘി ഘീ ഘു ഘൂ ഘൃ ഘൠ ഘെ ഘേ ഘൈ ഘൊ ഘോ ഘൌ ഘം ഘഃ
ങ ങാ ങി ങീ ങു ങൂ ങൃ ങൠ ങെ ങേ ങൈ ങൊ ങോ ങൌ ങം ങഃ
ച ചാ ചി ചീ ചു ചൂ ചൃ ചൠ ചെ ചേ ചൈ ചൊ ചോ ചൌ ചം ചഃ
ഛ ഛാ ഛി ഛീ ഛു ഛൂ ഛൃ ഛൠ ഛെ ഛേ ഛൈ ഛൊ ഛോ ഛൌ ഛം ഛഃ
ജ ജാ ജി ജീ ജു ജൂ ജൃ ജൠ ജെ ജേ ജൈ ജൊ ജോ ജൌ ജം ജഃ
ഝ ഝാ ഝി ഝീ ഝു ഝൂ ഝൃ ഝൠ ഝെ ഝേ ഝൈ ഝൊ ഝോ ഝൌ ഝം  ഝഃ
൦ ൧ ൨൩ ൪൫ ൬ ൭ ൮ ൯ : ; < = > ? ॒ [ \ ] ‍ _ { | }
~| || ! " # % & ' (  ) * + , - . /
ಞ ಞಾ ಞಿ ಞೀ ಞು ಞೂ ಞೃ ಞೄ ಞೆ ಞೇ ಞೈ ಞೊ ಞೋ ಞೌ ಞಂ ಞಃ
ಟ ಟಾ ಟಿ ಟೀ ಟು ಟೂ ಟೃ ಟೄ ಟಿ ಟೀ ಟೈ ಟೊ ಟೋ ಟೌ ಟಂ ಟಃ
ಠ ಠಾ ಠಿ ಠೀ ಠು ಠೂ ಠೃ ಠೄ ಠೆ ಠೇ ಠೈ ಠೊ ಠೋ ಠೌ ಠಂ ಠಃ
ಡ ಡಾ ಡಿ ಡೀ ಡು ಡೂ ಡೃ ಡೄ ಡೆ ಡೇ ಡೈ ಡೊ ಡೋ ಡೌ ಡಂ ಡಃ
ಢ ಢಾ ಢಿ ಢೀ ಢು ಢೂ ಢೃ ಢೄ ಢೆ ಢೇ ಢೈ ಢೊ ಢೋ ಢೌ ಢಂ ಢಃ
ಣ ಣಾ ಣಿ ಣೀ ಣು ಣೂ ಣೃ ಣೄ ಣೆ ಣೇ ಣೈ ಣೊ ಣೋ ಣೌ ಣಂ ಣಃ
ತ ತಾ ತಿ ತೀ ತು ತೂ ತೃ ತೄ ತೆ ತೇ ತೈ ತೊ ತೋ ತೌ ತಂ ತಃ
ಥ ಥಾ ಥಿ ಥೀ ಥು ಥೂ ಥೃ ಥೄ ಥೆ ಥೇ ಥೈ ಥೊ ಥೋ ಥೌ ಥಂ ಥಃ
ದ ದಾ ದಿ ದೀ ದು ದೂ ದೃ ದೄ ದೆ ದೇ ದೈ ದೊ ದೋ ದೌ ದಂ ದಃ
ಧ ಧಾ ಧಿ ಧೀ ಧು ಧೂ ಧೃ ಧೄ ಧೆ ಧೇ ಧೈ ಧೊ ಧೋ ಧೌ ಧಂ ಧಃ
ನ ನಾ ನಿ ನೀ ನು ನೂ ನೃ ನೄ ನೆ ನೇ ನೈ ನೊ ನೋ ನೌ ನಂ ನಃ
ಪ ಪಾ ಪಿ ಪೀ ಪು ಪೂ ಪೃ ಪೄ ಪೆ ಪೇ ಪೈ ಪೊ ಪೋ ಪೌ ಪಂ ಪಃ
ಫ ಫಾ ಫಿ ಫೀ ಫು ಫೂ ಫೃ ಫೄ ಫೆ ಫೇ ಫೈ ಫೊ ಫೋ ಫೌ ಫಂ ಫಃ
ಬ ಬಾ ಬಿ ಬೀ ಬು ಬೂ ಬೃ ಬೄ ಬೆ ಬೇ ಬೈ ಬೊ ಬೋ ಬೌ ಬಂ ಬಃ
ಭ ಭಾ ಭಿ ಭೀ ಭು ಭೂ ಭೃ ಭೄ ಭೆ ಭೇ ಭೈ ಭೊ ಭೋ ಭೌ ಭಂ ಭಃ
ಮ ಮಾ ಮಿ ಮೀ ಮು ಮೂ ಮೃ ಮೄ ಮೆ ಮೇ ಮೈ ಮೊ ಮೋ ಮೌ ಮಂ ಮಃ
ಯ ಯಾ ಯಿ ಯೀ ಯು ಯೂ ಯೃ ಯೄ ಯೆ ಯೇ ಯೈ ಯೊ ಯೋ ಯೌ ಯಂ ಯಃ
ರ ರಾ ರಿ ರೀ ರು ರೂ ರೃ ರೄ ರೆ ರೇ ರೈ ರೊ ರೋ ರೌ ರಂ ರಃ
ಲ ಲಾ ಲಿ ಲೀ ಲು ಲೂ ಲೃ ಲೄ ಲೆ ಲೇ ಲೈ ಲೊ ಲೋ ಲೌ ಲಂ ಲಃ
ಳ ಳಾ ಳಿ ಳೀ ಳು ಳೂ ಳೃ ಳೄ ಳೆ ಳೇ ಳೈ ಳೊ ಳೋ ಳೌ ಳಂ ಳಃ
ವ ವಾ ವಿ ವೀ ವು ವೂ ವೃ ವೄ ವೆ ವೇ ವೈ ವೊ ವೋ ವೌ ವಂ ವಃ
ಶ ಶಾ ಶಿ ಶೀ ಶು ಶೂ ಶೃ ಶೄ ಶೆ ಶೇ ಶೈ ಶೊ ಶೋ ಶೌ ಶಂ ಶಃ
ಷ ಷಾ ಷಿ ಷೀ ಷು ಷೂ ಷೃ ಷೄ ಷೆ ಷೇ ಷೈ ಷೊ ಷೋ ಷೌ ಷಂ ಷಃ
ಸ ಸಾ ಸಿ ಸೀ ಸು ಸೂ ಸೃ ಸೄ ಸೆ ಸೇ ಸೈ ಸೊ ಸೋ ಸೌ ಸಂ ಸಃ
ಹ ಹಾ ಹಿ ಹೀ ಹು ಹೂ ಹೃ ಹೄ ಹೆ ಹೇ ಹೈ ಹೊ ಹೋ ಹೌ ಹಂ ಹಃ
ಕ್ಕ ಖ್ಖ ಗ್ಗ ಘ್ಘ ಙ್ಙ ಚ್ಚ ಛ್ಛ ಜ್ಜ ಝ್ಝ ಞ್ಞ ಟ್ಟ ಠ್ಠ ಡ್ಡ ಢ್ಢ ಣ್ಣ ತ್ತ ಥ್ಥ ದ್ದ ಧ್ಧ
ನ್ನ ಪ್ಪ ಫ್ಫ ಬ್ಬ ಭ್ಭ ಮ್ಮ
ಯ್ಯ ರ್ರ ಲ್ಲ ಳ್ಳ ವ್ವ ಶ್ಶ ಷ್ಷ ಸ್ಸ ಹ್ಹ ಕ್ರ ಖ್ರ ಗ್ರ ಘ್ರ ಙ್ರ ಚ್ರ ಛ್ರ ಜ್ರ ಝ್ರ ಞ್ರ
ಠ್ರ ಡ್ರ ಢ್ರ ಣ್ರ
ತ್ರ ಥ್ರ ದ್ರ ಧ್ರ ನ್ರ ಪ್ರ ಫ್ರ ಬ್ರ ಭ್ರ ಮ್ರ ಯ್ರ ರ್ರ ಲ್ರ ಳ್ರ ವ್ರ ಶ್ರ ಷ್ರ ಸ್ರ ಹ್ರ
೦ ೧ ೨ ೩ ೪ ೫ ೬ ೭ ೮ ೯
ಶ್ಟು ಷ್ಟು ಕ್ಟ್ ಗ್ಧ ದ್ಧ ದ್ಧೆ ಸ್ದೇ ದ್ಭು ಶ್ರೀ ಸ್ತ್ರೀ ಪ್ರ ಸ್ರ ಣ್ಯ ಧ್ಯ ವ್ಯಾ ಪ್ಯಾ
ಲ್ಯ ತ್ಯ ತ್ತ್ಯನ್ಯೋ ಸ್ಯಾನ್ಯಃ ರ್ಣಾ ರ್ತೃ ರ್ಬ ತ್ತ್ವ ಶ್ವೇ ಶ್ವ ಸ್ವ ದ್ವಾ ದ್ವಾ ಸ್ವಾ
ದ್ವ ಜ್ಞಾ ಜ್ಞ ಬ್ರ ಹ್ಮಾಂ ನ್ಮ ಹ್ಮ ಲ್ಮೀ ಸ್ಪಶ್ನ ಸ್ತ ಯ್ತು ಪ್ತ ಷ್ಟ್ರೀ ಸ್ಟೆ ಫ್ಲಾ
ಕ್ಟ್ರಿ ಷ್ಠೆ ಶ್ಚಾ ತ್ಯ ಕ್ಪ್ಮ ಜ್ಞ ಷ್ಟ್ರ ಕ್ಷ್ಮಿ ಸ್ಥಿ ಷ್ಟ್ರಾ ವೈ ದ್ಯ್ಕ ನ್ಕೈ
ಕ್ಷಂ ಕ್ಷ ಕ್ಲಾ ಥ್ಯಾಂ ಕ್ಸ್. ಹ್ಯಾಂ ಲ್ಮೇ ಲ್ಡ

ಅ ಆ ಇ ಈ ಉ ಊ ಋ ೠ ಎ ಏ ಐ ಒ ಓ ಔ ಅಂ ಅಃ
ಕ ಕಾ ಕಿ ಕೀ ಕು ಕೂ ಕೃ ಕೄ ಕೆ ಕೇ ಕೈ ಕೊ ಕೋ ಕೌ ಕಂ ಕಃ
ಗ ಗಾ ಗಿ ಗೀ ಗು ಗೂ ಗೃ ಗೄ ಗೆ ಗೇ ಗೈ ಗೊ ಗೋ ಗೌ ಗಂ ಗಃ
ಖ ಖಾ ಖಿ ಖೀ ಖು ಖೂ ಖೃ ಖೄ ಖೆ ಖೇ ಖೈ ಖೊ ಖೋ ಖೌ ಖಂ ಖಃ
ಘ ಘಾ ಘಿ ಘೀ ಘು ಘೂ ಘೃ ಘೄ ಘೆ ಘೇ ಘೈ ಘೊ ಘೋ ಘೌ ಘಂ ಘಃ
ಙ ಙಾ ಙಿ ಙೀ ಙು ಙೂ ಙೃ ಙೄ ಙೆ ಙೇ ಙೈ ಙೊ ಙೋ ಙೌ ಙಂ ಙಃ
ಚ ಚಾ ಚಿ ಚೀ ಚು ಚೂ ಚೃ ಚೄ ಚೆ ಚೇ ಚೈ ಚೊ ಚೋ ಚೌ ಚಂ ಚಃ
ಛ ಛಾ ಛಿ ಛೀ ಛು ಛೂ ಛೃ ಛೄ ಛೆ ಛೇ ಛೈ ಛೊ ಛೋ ಛೌ ಛಂ ಛಃ
ಜ ಜಾ ಜಿ ಜೀ ಜು ಜೂ ಜೃ ಜೄ ಜೆ ಜೇ ಜೈ ಜೊ ಜೋ ಜೌ ಜಂ ಜಃ
ಝ ಝಾ ಝಿ ಝೀ ಝು ಝೂ ಝೃ ಝೄ ಝೆ ಝೇ ಝೈ ಝೊ ಝೋ ಝೌ ಝಂ  ಝಃ
0 1 2 3 4 5 6 7 8 9 : ; < = > ? @ [ \ ] ^ _ { | }
~÷। ॥ ! " # % & ' (  ) * + , - . /
ಞ ಞಾ ಞಿ ಞೀ ಞು ಞೂ ಞೃ ಞೄ ಞೆ ಞೇ ಞೈ ಞೊ ಞೋ ಞೌ ಞಂ ಞಃ
ಟ ಟಾ ಟಿ ಟೀ ಟು ಟೂ ಟೃ ಟೄ ಟಿ ಟೀ ಟೈ ಟೊ ಟೋ ಟೌ ಟಂ ಟಃ
ಠ ಠಾ ಠಿ ಠೀ ಠು ಠೂ ಠೃ ಠೄ ಠೆ ಠೇ ಠೈ ಠೊ ಠೋ ಠೌ ಠಂ ಠಃ
ಡ ಡಾ ಡಿ ಡೀ ಡು ಡೂ ಡೃ ಡೄ ಡೆ ಡೇ ಡೈ ಡೊ ಡೋ ಡೌ ಡಂ ಡಃ
ಢ ಢಾ ಢಿ ಢೀ ಢು ಢೂ ಢೃ ಢೄ ಢೆ ಢೇ ಢೈ ಢೊ ಢೋ ಢೌ ಢಂ ಢಃ
ಣ ಣಾ ಣಿ ಣೀ ಣು ಣೂ ಣೃ ಣೄ ಣೆ ಣೇ ಣೈ ಣೊ ಣೋ ಣೌ ಣಂ ಣಃ
ತ ತಾ ತಿ ತೀ ತು ತೂ ತೃ ತೄ ತೆ ತೇ ತೈ ತೊ ತೋ ತೌ ತಂ ತಃ
ಥ ಥಾ ಥಿ ಥೀ ಥು ಥೂ ಥೃ ಥೄ ಥೆ ಥೇ ಥೈ ಥೊ ಥೋ ಥೌ ಥಂ ಥಃ
ದ ದಾ ದಿ ದೀ ದು ದೂ ದೃ ದೄ ದೆ ದೇ ದೈ ದೊ ದೋ ದೌ ದಂ ದಃ
ಧ ಧಾ ಧಿ ಧೀ ಧು ಧೂ ಧೃ ಧೄ ಧೆ ಧೇ ಧೈ ಧೊ ಧೋ ಧೌ ಧಂ ಧಃ
ನ ನಾ ನಿ ನೀ ನು ನೂ ನೃ ನೄ ನೆ ನೇ ನೈ ನೊ ನೋ ನೌ ನಂ ನಃ
ಪ ಪಾ ಪಿ ಪೀ ಪು ಪೂ ಪೃ ಪೄ ಪೆ ಪೇ ಪೈ ಪೊ ಪೋ ಪೌ ಪಂ ಪಃ
ಫ ಫಾ ಫಿ ಫೀ ಫು ಫೂ ಫೃ ಫೄ ಫೆ ಫೇ ಫೈ ಫೊ ಫೋ ಫೌ ಫಂ ಫಃ
ಬ ಬಾ ಬಿ ಬೀ ಬು ಬೂ ಬೃ ಬೄ ಬೆ ಬೇ ಬೈ ಬೊ ಬೋ ಬೌ ಬಂ ಬಃ
ಭ ಭಾ ಭಿ ಭೀ ಭು ಭೂ ಭೃ ಭೄ ಭೆ ಭೇ ಭೈ ಭೊ ಭೋ ಭೌ ಭಂ ಭಃ
ಮ ಮಾ ಮಿ ಮೀ ಮು ಮೂ ಮೃ ಮೄ ಮೆ ಮೇ ಮೈ ಮೊ ಮೋ ಮೌ ಮಂ ಮಃ
ಯ ಯಾ ಯಿ ಯೀ ಯು ಯೂ ಯೃ ಯೄ ಯೆ ಯೇ ಯೈ ಯೊ ಯೋ ಯೌ ಯಂ ಯಃ
ರ ರಾ ರಿ ರೀ ರು ರೂ ರೃ ರೄ ರೆ ರೇ ರೈ ರೊ ರೋ ರೌ ರಂ ರಃ
ಲ ಲಾ ಲಿ ಲೀ ಲು ಲೂ ಲೃ ಲೄ ಲೆ ಲೇ ಲೈ ಲೊ ಲೋ ಲೌ ಲಂ ಲಃ
ಳ ಳಾ ಳಿ ಳೀ ಳು ಳೂ ಳೃ ಳೄ ಳೆ ಳೇ ಳೈ ಳೊ ಳೋ ಳೌ ಳಂ ಳಃ
ವ ವಾ ವಿ ವೀ ವು ವೂ ವೃ ವೄ ವೆ ವೇ ವೈ ವೊ ವೋ ವೌ ವಂ ವಃ
ಶ ಶಾ ಶಿ ಶೀ ಶು ಶೂ ಶೃ ಶೄ ಶೆ ಶೇ ಶೈ ಶೊ ಶೋ ಶೌ ಶಂ ಶಃ
ಷ ಷಾ ಷಿ ಷೀ ಷು ಷೂ ಷೃ ಷೄ ಷೆ ಷೇ ಷೈ ಷೊ ಷೋ ಷೌ ಷಂ ಷಃ
ಸ ಸಾ ಸಿ ಸೀ ಸು ಸೂ ಸೃ ಸೄ ಸೆ ಸೇ ಸೈ ಸೊ ಸೋ ಸೌ ಸಂ ಸಃ
ಹ ಹಾ ಹಿ ಹೀ ಹು ಹೂ ಹೃ ಹೄ ಹೆ ಹೇ ಹೈ ಹೊ ಹೋ ಹೌ ಹಂ ಹಃ
ಕ್ಕ ಖ್ಖ ಗ್ಗ ಘ್ಘ ಙ್ಙ ಚ್ಚ ಛ್ಛ ಜ್ಜ ಝ್ಝ ಞ್ಞ ಟ್ಟ ಠ್ಠ ಡ್ಡ ಢ್ಢ ಣ್ಣ ತ್ತ ಥ್ಥ ದ್ದ ಧ್ಧ
ನ್ನ ಪ್ಪ ಫ್ಫ ಬ್ಬ ಭ್ಭ ಮ್ಮ
ಯ್ಯ ರ್ರ ಲ್ಲ ಳ್ಳ ವ್ವ ಶ್ಶ ಷ್ಷ ಸ್ಸ ಹ್ಹ ಕ್ರ ಖ್ರ ಗ್ರ ಘ್ರ ಙ್ರ ಚ್ರ ಛ್ರ ಜ್ರ ಝ್ರ ಞ್ರ
ಠ್ರ ಡ್ರ ಢ್ರ ಣ್ರ
ತ್ರ ಥ್ರ ದ್ರ ಧ್ರ ನ್ರ ಪ್ರ ಫ್ರ ಬ್ರ ಭ್ರ ಮ್ರ ಯ್ರ ರ್ರ ಲ್ರ ಳ್ರ ವ್ರ ಶ್ರ ಷ್ರ ಸ್ರ ಹ್ರ
೦ ೧ ೨ ೩ ೪ ೫ ೬ ೭ ೮ ೯
ಶ್ಟು ಷ್ಟು ಕ್ಟ್ ಗ್ಧ ದ್ಧ ದ್ಧೆ ಸ್ದೇ ದ್ಭು ಶ್ರೀ ಸ್ತ್ರೀ ಪ್ರ ಸ್ರ ಣ್ಯ ಧ್ಯ ವ್ಯಾ ಪ್ಯಾ
ಲ್ಯ ತ್ಯ ತ್ತ್ಯನ್ಯೋ ಸ್ಯಾನ್ಯಃ ರ್ಣಾ ರ್ತೃ ರ್ಬ ತ್ತ್ವ ಶ್ವೇ ಶ್ವ ಸ್ವ ದ್ವಾ ದ್ವಾ ಸ್ವಾ
ದ್ವ ಜ್ಞಾ ಜ್ಞ ಬ್ರ ಹ್ಮಾಂ ನ್ಮ ಹ್ಮ ಲ್ಮೀ ಸ್ಪಶ್ನ ಸ್ತ ಯ್ತು ಪ್ತ ಷ್ಟ್ರೀ ಸ್ಟೆ ಫ್ಲಾ
ಕ್ಟ್ರಿ ಷ್ಠೆ ಶ್ಚಾ ತ್ಯ ಕ್ಪ್ಮ ಜ್ಞ ಷ್ಟ್ರ ಕ್ಷ್ಮಿ ಸ್ಥಿ ಷ್ಟ್ರಾ ವೈ ದ್ಯ್ಕ ನ್ಕೈ
ಕ್ಷಂ ಕ್ಷ ಕ್ಲಾ ಥ್ಯಾಂ ಕ್ಸ್. ಹ್ಯಾಂ ಲ್ಮೇ ಲ್ಡ

----------
From: *74yrs old* <withblessings at gmail.com>
Date: Thu, May 8, 2008 at 3:40 PM
To: Basil Brassily Thoppil <basilbthoppil at gmail.com>

Basil,
I like to have your feedback of your experience
Best Wishes.

2008/5/3 74yrs old <withblessings at gmail.com>:

--~--~---------~--~----~------------~-------~--~----~
സ്വതന്ത്ര മലയാളം കമ്പ്യൂട്ടിങ്ങ് - എന്റെ കമ്പ്യൂട്ടറിന് എന്റെ ഭാഷ
പിരിഞ്ഞു പോകാന്‍: smc-discuss-unsubscribe at googlegroups.com
സംരംഭം: https://savannah.nongnu.org/projects/smc
-~----------~----~----~----~------~----~------~--~---
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/attachment-0001.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Scan-mayyazhi.jpg
Type: image/jpeg
Size: 352705 bytes
Desc: not available
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/Scan-mayyazhi.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 4lang.bmp
Type: image/bmp
Size: 1440054 bytes
Desc: not available
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/4lang.bmp>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: a.tif
Type: image/tiff
Size: 18314 bytes
Desc: not available
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/a.tif>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: a.bmp
Type: image/bmp
Size: 718 bytes
Desc: not available
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/a.bmp>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 4lang.box
Type: application/octet-stream
Size: 1480 bytes
Desc: not available
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/4lang.box>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: 4lang.txt
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/4lang.txt>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 4lang.tr
Type: application/x-troff
Size: 138452 bytes
Desc: not available
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/4lang.tr>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: a.box
Type: application/octet-stream
Size: 30 bytes
Desc: not available
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/a.box>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: a.box
Type: application/octet-stream
Size: 61 bytes
Desc: not available
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/a-0001.box>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: a.tr
Type: application/x-troff
Size: 4746 bytes
Desc: not available
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/a.tr>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: hand.txt
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/hand.txt>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: malayam+kan-Unicode.txt
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20080612/70ddd5e0/malayamkan-Unicode.txt>