If I had more time, I would have written shorter letter — Blaise Pascal [This is the article published on DZone: https://dzone.com/articles/sql-on-twitter-twitter-analysis-made-easy] There have been lengthy articles on analyzing Twitter data. From Cloudera: here , here , and here . More from Hortonworks here and here . This one from Couchbase is going to be short, save the examples and results. Step 1: Install Couchbase 4.5 . Use the Couchbase console create a bucket called Twitter and CREATE PRIMARY INDEX on Twitter using the query workbench or cbq shell. CREATE PRIMARY INDEX ON twitter; Step 2: Request your Twitter archive . Once you receive it, unzip it. (You can use larger twitter archives as well): cd <to the unzipped location>/data/js/tweets Step 3: $ for i in `ls` ; do grep -i -v ^Grailbir...
A blog about all things data and data processing issues and interests. SQL, NoSQL, flexible schema, scale-up, scale-out, transactions, high availability.