[smc-discuss] GSoC 2013 A spell checker for Indic language that understands inflections

Jahfar Ali jahfar.ali at gmail.com
Fri May 3 22:14:38 PDT 2013


TRIE data structure, is generally used for Dictionary along with certain
rules for Sandhi,
make to do the role of sandhi splitter. I think it is better option than
REGEX.

There is intrinsic dependency between sandhi and morphological analyser
in malayalam,
As it is agglutinative language.

,


On Fri, May 3, 2013 at 2:28 PM, Priya Pappachan <priyapappachan010 at gmail.com
> wrote:

> Yes, Hunspell is a spell checker and morphological analyzer.
>
>
> On Fri, May 3, 2013 at 2:03 PM, മഹേഷ് മുകുന്ദന്‍ | Mahesh M <
> maheshmukundan at gmail.com> wrote:
>
>> Had not read sandhi part of proposal. Regex seems enough. There is
>> nothing that wont be addressed by that i guess.
>>
>> Thanks.
>>
>>
>> On Fri, May 3, 2013 at 12:59 PM, മഹേഷ് മുകുന്ദന്‍ | Mahesh M <
>> maheshmukundan at gmail.com> wrote:
>>
>>> As far as I understand Sandhivichedam is part of spell checking. You
>>> cant do a spell check without doing sandhivichedam, right?
>>>
>>>
>>> On Thu, May 2, 2013 at 9:37 PM, Priya Pappachan <
>>> priyapappachan010 at gmail.com> wrote:
>>>
>>>>
>>>> The project is to achieve spell checking using hunspell. If that is not
>>>> feasible, a python based solution has to be attempted.
>>>>
>>>>
>>>> On Wed, May 1, 2013 at 11:08 PM, മഹേഷ് മുകുന്ദന്‍ | Mahesh M <
>>>> maheshmukundan at gmail.com> wrote:
>>>>
>>>>> I remember seeing an option to put language specific code in Hunspell.
>>>>> Will that be useful/needed for sandhivichchedam instead of writing
>>>>> something in python?
>>>>>
>>>>>
>>>>> On Wed, May 1, 2013 at 6:29 PM, Priya Pappachan <
>>>>> priyapappachan010 at gmail.com> wrote:
>>>>>
>>>>>> Hi,
>>>>>>         I started studying rules that is to be followed for the
>>>>>> language while writing affix file. The different sandhi rules has to be
>>>>>> known. I also understood about the challenges that can occur while
>>>>>> scripting due to lack of unique system and in formulating sandhi rules in
>>>>>> the language.
>>>>>>
>>>>>>         I request you to go through my updated proposal.
>>>>>>         http://wiki.smc.org.in/User:Priyapappachan/GSoC-spellchecker
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> On Mon, Apr 29, 2013 at 1:11 PM, Priya Pappachan <
>>>>>> priyapappachan010 at gmail.com> wrote:
>>>>>>
>>>>>>> Here is my updated proposal.
>>>>>>>
>>>>>>> http://wiki.smc.org.in/User:Priyapappachan/GSoC-spellchecker
>>>>>>>
>>>>>>>
>>>>>>> I read about the rules that is to be followed to handle inflections
>>>>>>> in malayalam.I will learn them in detail to write affix file.
>>>>>>>
>>>>>>>
>>>>>>> On Mon, Apr 29, 2013 at 10:13 AM, Santhosh Thottingal <
>>>>>>> santhosh.thottingal at gmail.com> wrote:
>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On Sunday, April 28, 2013, Priya Pappachan wrote:
>>>>>>>>
>>>>>>>>> Hi
>>>>>>>>>
>>>>>>>>> Here is my proposal http://wiki.smc.org.in/User:Priyapappachan
>>>>>>>>>
>>>>>>>>
>>>>>>>> I have left some notes here
>>>>>>>> http://wiki.smc.org.in/User_talk:Priyapappachan/GSoC-spellchecker
>>>>>>>>
>>>>>>>> santhosh
>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> Swathanthra Malayalam Computing discuss Mailing List
>>>>>>>> Project: https://savannah.nongnu.org/projects/smc
>>>>>>>> Web: http://smc.org.in | IRC : #smc-project @ freenode
>>>>>>>> discuss at lists.smc.org.in
>>>>>>>> http://lists.smc.org.in/listinfo.cgi/discuss-smc.org.in
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>> Priya
>>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>> Priya
>>>>>>
>>>>>> _______________________________________________
>>>>>> Swathanthra Malayalam Computing discuss Mailing List
>>>>>> Project: https://savannah.nongnu.org/projects/smc
>>>>>> Web: http://smc.org.in | IRC : #smc-project @ freenode
>>>>>> discuss at lists.smc.org.in
>>>>>> http://lists.smc.org.in/listinfo.cgi/discuss-smc.org.in
>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Regards,
>>>>>
>>>>> Mahesh M
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> Swathanthra Malayalam Computing discuss Mailing List
>>>>> Project: https://savannah.nongnu.org/projects/smc
>>>>> Web: http://smc.org.in | IRC : #smc-project @ freenode
>>>>> discuss at lists.smc.org.in
>>>>> http://lists.smc.org.in/listinfo.cgi/discuss-smc.org.in
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> Priya
>>>>
>>>> _______________________________________________
>>>> Swathanthra Malayalam Computing discuss Mailing List
>>>> Project: https://savannah.nongnu.org/projects/smc
>>>> Web: http://smc.org.in | IRC : #smc-project @ freenode
>>>> discuss at lists.smc.org.in
>>>> http://lists.smc.org.in/listinfo.cgi/discuss-smc.org.in
>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>> Regards,
>>>
>>> Mahesh M
>>>
>>>
>>
>>
>> --
>> Regards,
>>
>> Mahesh M
>>
>>
>> _______________________________________________
>> Swathanthra Malayalam Computing discuss Mailing List
>> Project: https://savannah.nongnu.org/projects/smc
>> Web: http://smc.org.in | IRC : #smc-project @ freenode
>> discuss at lists.smc.org.in
>> http://lists.smc.org.in/listinfo.cgi/discuss-smc.org.in
>>
>>
>>
>
>
> --
> Priya
>
> _______________________________________________
> Swathanthra Malayalam Computing discuss Mailing List
> Project: https://savannah.nongnu.org/projects/smc
> Web: http://smc.org.in | IRC : #smc-project @ freenode
> discuss at lists.smc.org.in
> http://lists.smc.org.in/listinfo.cgi/discuss-smc.org.in
>
>
>


-- 

 Jahfar Ali P
 Asst.Professor
Dept. of Computer Science and Engineering
MES College of Engineering
Kuttipuram.
Mob: 07736663602
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.smc.org.in/pipermail/discuss-smc.org.in/attachments/20130504/d704229d/attachment-0002.htm>


More information about the discuss mailing list