text analytics

fundamental data science & machine learning with Python + Jupyter

June 27, 2019 | Nashville, TN



Text data represents 80-90% of enterprise data – and is growing at a rate of 55-65% per year.  However, text analytics is a nascent area and unstructured data is difficult to synthesize and analyze without a defined data model.

In this immersive 1-day certificate workshop, students will learn actionable steps to began leveraging your text data, with real-world case studies and code to reuse when developing your own applications.

This workshop provides a hands-on introduction to text analytics. We will use three classic business use cases as our guides: rating movies at Rotten Tomatoes, preventing bullying on Twitter, and pricing beer based on the label. Along the way, you will learn about natural language processing, feature engineering, dimension reduction, visualization, and statistical inference in Python (with NLTK, scikit-learn, and seaborn).

Our workshops are fun and personalized in a small class setting and taught by leading experts in the field.  This workshop is 1-day certificate course with a hands-on approach that assures you'll be able to apply what you learned right away.

experience level:

Joe Sutherland specializes in Computational Social Science and Text Analytics, applying techniques from computer science to questions of substantive interest in political science and economics. His current research focuses on how to study political representation with text data. Additionally, his academic and popular publications study political behavior, elections, political methodology, natural language processing, and machine learning.

Sutherland’s goal is to make the methods and software developed in his course of doctoral research at Columbia University beneficial to as broad of an audience as possible. His recently published open-source computer vision software has seen incredible adoption by researchers developing text corpora (it has been forked and starred more than 99.9% of projects on GitHub).

topics + tools

NLP basics

learn the basics of NLP and supervised/unsupervised approaches in machine learning

real-world application

get an edge in your industry by applying text analysis in a variety of business use cases


use feature engineering, dimension reduction, visualization, and statistical inference in Python (with NLTK, scikit-learn, and seaborn)

schedule + lesson plan

thurs, february 28th 2019 | 8 am-4 pm

workshops: 8 am-4 pm  |  happy hour: 4-6 pm  |  healthcare analytics panel: 6-9 pm


8:00 am - 9:00 am

Breakfast + Registration

9:00 am - 10:30 am

Morning Session

10:30 am - 10:45 am

Coffee + Tea Break

10:45 am - 12:00 pm

Morning Session Cont'd

12:00 pm - 1:00 pm

Lunch + Networking


1:00 pm - 2:30 pm

Afternoon Session

2:30 pm - 2:45 pm

Coffee + Tea Break

2:45 pm - 3:30 pm

Afternoon Session Cont'd

3:30 pm - 4:00 pm

Conclude + Q&A

- lesson plan coming soon! -

4:00 pm - 6:00 pm

Post-Workshop Happy Hour

  • Drinks and light refreshments at South City Kitchen

Option to work remotely from Atlanta Tech Village

  • Conference rooms TBD

6:00 pm - 9:00 pm

Healthcare Analytics Panel

  • Dinner and open bar included
  • Register for Healthcare Analytics Panel HERE
    • Purchase bundled tickets for additional savings

venue + parking

location TBA soon!