Text representations as few-shot classifiers

<Session_Outline/>

Text classification is a ubiquitous capability with a wealth of use cases. While dozens of techniques now exist for this fundamental task, many of them require massive amounts of labeled data in order to prove useful. Collecting annotations for your use case, however, is typically one of the most costly parts of any machine learning application. In this talk, I’ll explain how text representations (embeddings) can be leveraged as classifiers, trained with only a small amount of labeled data, or even with no labeled data at all. I’ll also give a demo of this method in action.

<Key_Takeaways/>

Learn about various limited-labeled data paradigms and strategies
Understand how popular text embedding models (SentenceBERT, Word2Vec) can be used as classifiers
Prototype demonstration via a simple Streamlit application
Insights on the strengths and limitations of text embeddings as classifiers

————————————————————————————————————————————————————

<Speaker_Bio/>

Melanie Beck – Machine Learning Research Engineer | Cloudera

Melanie Beck is a Research Engineer at Cloudera Fast Forward where she delights in translating machine learning breakthroughs into practical applications, and is particularly interested in natural language processing capabilities. With experience in machine learning and data science at diverse organizations – from manufacturing to cybersecurity – she is a jack-of-all-trades problem solver as well as a reformed astrophysicist, holding a PhD in Astrophysics from the University of Minnesota.

May 26 @ 13:00

13:00 — 13:30 (30′)

Day 2 | 19th of May – Machine Learning

Melanie Beck – Machine Learning Research Engineer | Cloudera

Cookie	Duration	Description
__cfduid	1 month	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
bcookie	2 years	This cookie is set by linkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	This cookie is set by LinkedIn and used for routing.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to deliver advertisement when they are on Facebook or a digital platform powered by Facebook advertising after visiting this website.
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
fr	3 months	The cookie is set by Facebook to show relevant advertisments to the users and measure and improve the advertisements. The cookie also tracks the behavior of the user across the web on sites that have Facebook pixel or Facebook social plugin.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.

Cookie	Duration	Description
_ga_P9NY14LEKW	2 years	No description
AnalyticsSyncHistory	1 month	No description
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.

Text representations as few-shot classifiers

Melanie Beck – Machine Learning Research Engineer | Cloudera

Hyperight Summits

Legal

Contact