TACIT: Text Analysis, Crawling and Interpretation Tool
TACIT is a unified, open-source architecture for gathering, managing and analyzing text. TACIT's plugin architecture has three main components:
- Crawling plugins, for automated text collection from online sources (e.g., US Senate and Supreme Court speech transcriptions, Twitter, Reddit)
- Analysis plugins, including LIWC-type word count, topic modeling, sentiment analysis, clustering and classification.
- Corpus management, for applying standard text preprocessing to prepare and store corpora.
TACIT's open-source plugin platform allows the architecture to easily adapt to today's rapid developments in text analysis. To view the full list of plugins visit: http://tacit.usc.edu/pluginsList.html