Notes for Big Data Class December 1, 2019

This will be both a paper for the Law Center and a module of my Big Data class. The plan is to take in a bunch of Word documents(Supreme Court decisions) as input, and then subject them to a series of automated textual analyses: classification, clustering, and sentiment analysis. Possibly a couple of visualizations as well.

Annotation 2019-12-01 175556.png

I'm going to try this out with KNIME, which is turning out to be an easy to use (translation:non-STEM undergraduate friendly, no coding) data analytics platform. So far, I was abIe was able to bang out a workflow to ingest text files and turn it into a data structure. Next step is to map it to another data structure that maps words with sentiments.

I wonder what will happen if I feed Republic v. Sereno into a neural network, and then plug that into a chatbot. Will it be a rambling misogynist? Or the very model of reason?