Topic: Applying Interactive Weak Supervision to NLP Tasks
Speaker: Shayan Mohanty, CEO and Co-Founder of Watchful
Bio: Shayan has spent over a decade of leading data engineering teams at various companies including Facebook, where he served as lead for the stream processing team responsible for processing 100% of the ads metrics data for all FB products. He is also a Guest Scientist at Los Alamos National Laboratory and has given talks on topics ranging from Automata Theory to Machine Teaching.
Abstract: Weakly supervised approaches have gained popularity in the last two years, but there is still a significant amount of overhead in applying these methods to more complex NLP tasks. The performance of weakly supervised systems is contingent on both the quality and quantity of independent sources of weak signals- if a practitioner cannot come up with sufficient sources themselves then weak supervision is largely impractical.
To overcome this, we can use techniques to interactively generate candidate sources of weak supervision to guide the practitioner, making weak supervision practical for many tasks that would otherwise be difficult to support. In this tutorial, we'll first build a basic weakly supervised system for an NLP task, and then augment it with some of these generative techniques to speed up the iterative process.
ODSC brings together the open-source and data science communities with the goal of helping its members learn, connect and grow.
The focus of this Meetup group is to allow ODSC to work with Meetup groups, non-profits, and other organizations to present informative lectures, workshops, code sprints and networking events to help grow the use of open source languages and tools within the data science and data-centric community. As such, our specific goals are:
1. Build a collaborative group to work with other Meetup groups, non-profits, and other organizations.
2. Promote the use of open source languages and tools amongst data scientists and others.
3. Host educational workshops.
4. Spread awareness of new open source languages and tools that can be used in data science.
5. Contribute back to the open-source community.
Who is this meetup for?
• Data engineers, analysts, scientists, and other practitioners
• R, Python and other software engineers who work with data or want to learn
• Data visualization developers and designers
• Non-technical team leads, executives, and other decision-makers from data-centric startups and large companies looking to utilize open-source tools