Hermes is a repository of software, documentation and data for NLP. I am currently adding corpora extracted from Wikipedia (mostrly in Romance languages).
Latest 5 files |
|||
---|---|---|---|
名称 | 大小 | 日期 | 下载总数 |
setup.exe | 6.0 MB | 2013-04-27 03:06 | 10 |
gl-20120625.dict.bz2 | 2.1 MB | 2012-07-10 03:11 | 8 |
README | 2.5 KB | 2012-07-10 03:10 | 3 |
ka-20120707.sent.bz2 | 14.6 MB | 2012-07-10 02:26 | 6 |
ka-20120707.punkt.bz2 | 342.7 KB | 2012-07-10 02:19 | 10 |
全文件 |
|||
setup.exe | 6.0 MB | 2013-04-27 03:06 | 10 |
corpora | |||
gl-20120625.dict.bz2 | 2.1 MB | 2012-07-10 03:11 | 8 |
ka-20120707.sent.bz2 | 14.6 MB | 2012-07-10 02:26 | 6 |
ka-20120707.punkt.bz2 | 342.7 KB | 2012-07-10 02:19 | 10 |
it-20120608.pars.bz2 | 437.1 MB | 2012-07-09 14:27 | 6 |
ka-20120707.pars.bz2 | 14.6 MB | 2012-07-09 05:56 | 2 |
gl-20120625.lm5.bz2 | 69.5 MB | 2012-07-08 05:59 | 3 |
gl-20120625.lm3.bz2 | 46.4 MB | 2012-07-08 05:22 | 3 |
gl-20120625.tokens.bz2 | 35.8 MB | 2012-07-07 21:22 | 2 |
gl-20120625.punkt.bz2 | 3.1 MB | 2012-07-06 00:28 | 3 |
gl-20120625.sent.bz2 | 36.5 MB | 2012-07-06 00:24 | 3 |
gl-20120625.pars.bz2 | 36.6 MB | 2012-07-06 00:04 | 6 |
README | 2.5 KB | 2012-07-10 03:10 | 3 |
tools | |||
SingleLineTokens.py | 0.3 KB | 2012-07-07 22:54 | 4 |
Lowercase.py | 0.2 KB | 2012-07-07 22:54 | 4 |
README | 0.5 KB | 2012-07-07 22:54 | 10 |
WikiExtractor.py | 20.8 KB | 2012-07-06 00:55 | 245 |
TrainPunkt.py | 1.5 KB | 2012-07-06 00:55 | 6 |
RawFilter.py | 1.3 KB | 2012-07-06 00:55 | 4 |
Punkt.py | 1.1 KB | 2012-07-06 00:55 | 16 |