AN ESTIMATED 80-90% OF ENTERPRISE DATA IS IN TEXT FORM.
ENTER: TEXT ANALYTICS
Text data represents 80-90% of enterprise data – and is growing at a rate of 55-65% per year. However, text analytics is a nascent area and unstructured data is difficult to synthesize and analyze without a defined data model.
In this immersive 1-day certificate workshop, students will learn actionable steps to began leveraging your text data, with real-world case studies and code to reuse when developing your own applications.
This workshop provides a hands-on introduction to text analytics. We will use three classic business use cases as our guides: rating movies at Rotten Tomatoes, preventing bullying on Twitter, and pricing beer based on the label. Along the way, you will learn about natural language processing, feature engineering, dimension reduction, visualization, and statistical inference in Python (with NLTK, scikit-learn, and seaborn).
Our workshops are fun and personalized in a small class setting and taught by leading experts in the field. This workshop is 1-day certificate course with a hands-on approach that assures you'll be able to apply what you learned right away.
experience level:
Joe Sutherland specializes in Computational Social Science and Text Analytics, applying techniques from computer science to questions of substantive interest in political science and economics. His current research focuses on how to study political representation with text data. Additionally, his academic and popular publications study political behavior, elections, political methodology, natural language processing, and machine learning.
Sutherland’s goal is to make the methods and software developed in his course of doctoral research at Columbia University beneficial to as broad of an audience as possible. His recently published open-source computer vision software has seen incredible adoption by researchers developing text corpora (it has been forked and starred more than 99.9% of projects on GitHub).
topics + tools
NLP basics
learn the basics of NLP and supervised/unsupervised approaches in machine learning
real-world application
get an edge in your industry by applying text analysis in a variety of business use cases
fundamentals
use feature engineering, dimension reduction, visualization, and statistical inference in Python (with NLTK, scikit-learn, and seaborn)
schedule + lesson plan
thurs, february 28th 2019 | 8 am-4 pm
workshops: 8 am-4 pm | happy hour: 4-6 pm | healthcare analytics panel: 6-9 pm
morning
8:00 am - 9:00 am
Breakfast + Registration
9:00 am - 10:30 am
Morning Session
10:30 am - 10:45 am
Coffee + Tea Break
10:45 am - 12:00 pm
Morning Session Cont'd
12:00 pm - 1:00 pm
Lunch + Networking
afternoon
1:00 pm - 2:30 pm
Afternoon Session
2:30 pm - 2:45 pm
Coffee + Tea Break
2:45 pm - 3:30 pm
Afternoon Session Cont'd
3:30 pm - 4:00 pm
Conclude + Q&A
- lesson plan coming soon! -
4:00 pm - 6:00 pm
Post-Workshop Happy Hour
- Drinks and light refreshments at South City Kitchen
Option to work remotely from Atlanta Tech Village
- Conference rooms TBD
6:00 pm - 9:00 pm
Healthcare Analytics Panel
- Dinner and open bar included
- Register for Healthcare Analytics Panel HERE
- Purchase bundled tickets for additional savings