Corpus analysis software program
Is there any software program that i can use for analyzing language corpora other than English corpora? I tried EM editor but it doesn't seem to work.
Is there any software program that i can use for analyzing language corpora other than English corpora? I tried EM editor but it doesn't seem to work.
See the answer here:
http://askaling.linguistlist.org/ques...
for example AntConc should be useful, but also general Unix command line tools can allow you to process or analyze language corpora for other languages, assuming that you are talking about other scripts or text encodings (e.g. some Unicode standard). Otherwise, you can always code simple tools in Python or Java yourself. You might want to look into NLTK for Python, which can process various file formats and Unicode encoded texts.
DC
If you're just looking for something like EmEditor, i.e. a text editor for Windows that can support large file sizes, there are many out there. I'd try out the software suggested at these pages:
Many of the listed programs are free, and most should have full Unicode support (hence be usable with non-English corpora).
I personally use Notepad++, and a colleague of mine swears by Crimson Editor, but I'm not sure if either of them support files as large as you need them to.
Asked: 2015-04-28 19:10:51 -0400
Seen: 449 times
Last updated: May 01 '15