[smc-discuss] 'Named Entities' labeled open data [april 2018] is now available publically.

fennecfox at openmailbox.org fennecfox at openmailbox.org
Mon Apr 30 22:54:24 PDT 2018


Hi, we have been and still collecting NER labeled contributions from the public. As part of timed publishing, all the data collected during April 2018 made public today. Feel free to download and play with it. Currently, no licenses are attached to the data. We're constantly thinking whether or not we need one to make it self-sustainable and useful to the masses in the future. Last month, we were able to tag a total of 2056 entities (under 7 categories - person, location, organization, time, date, money, percent) from 128 news articles (most of them in Malayalam, along with two or three Hindi articles). Because the data collection interface is not capable of handling spamming, we're currently notifying the trusty crowd only. We're working on it to release it to the public very soon. Thanks.

Link: https://github.com/a-mma/a-mma_NER_Open_Data


More information about the discuss mailing list