AN ESTIMATED 80-90% OF ENTERPRISE DATA IS IN TEXT FORM.
ENTER: TEXT ANALYTICS
Text data represents 80-90% of enterprise data – and is growing at a rate of 55-65% per year. However, text analytics is a nascent area and unstructured data is difficult to synthesize and analyze without a defined data model.
In this immersive 1-day certificate workshop, students will learn actionable steps to began leveraging your text data, with real-world case studies and code to reuse when developing your own applications.
This workshop provides a hands-on introduction to text analytics. We will use three classic business use cases as our guides: rating movies at Rotten Tomatoes, preventing bullying on Twitter, and pricing beer based on the label. Along the way, you will learn about natural language processing, feature engineering, dimension reduction, visualization, and statistical inference in Python (with NLTK, scikit-learn, and seaborn).
Our workshops are fun and personalized in a small class setting and taught by leading experts in the field. This workshop is 1-day certificate course with a hands-on approach that assures you'll be able to apply what you learned right away.
Joe Sutherland specializes in Computational Social Science and Text Analytics, applying techniques from computer science to questions of substantive interest in political science and economics. His current research focuses on how to study political representation with text data. Additionally, his academic and popular publications study political behavior, elections, political methodology, natural language processing, and machine learning.
Sutherland’s goal is to make the methods and software developed in his course of doctoral research at Columbia University beneficial to as broad of an audience as possible. His recently published open-source computer vision software has seen incredible adoption by researchers developing text corpora (it has been forked and starred more than 99.9% of projects on GitHub).
topics + tools
learn the basics of NLP and supervised/unsupervised approaches in machine learning
get an edge in your industry by applying text analysis in a variety of business use cases
use feature engineering, dimension reduction, visualization, and statistical inference in Python (with NLTK, scikit-learn, and seaborn)
8:00 am - 9:00 am
Breakfast + Registration
9:00 am - 10:30 am
10:30 am - 10:45 am
Coffee + Tea Break
10:45 am - 12:00 pm
Morning Session Cont'd
12:00 pm - 1:00 pm
Lunch + Networking
1:00 pm - 2:30 pm
2:30 pm - 2:45 pm
Coffee + Tea Break
2:45 pm - 3:30 pm
Afternoon Session Cont'd
3:30 pm - 4:00 pm
Conclude + Q&A
- lesson plan coming soon! -
4:00 pm - 6:00 pm
Post-Workshop Happy Hour
- Drinks and light refreshments at South City Kitchen
Option to work remotely from Atlanta Tech Village
- Conference rooms TBD