CLMS Career Exploration Talk
Oct 9, 2012
About Me
- BA, Linguistics from UW in 2008
- Graduated from CLMA in 2010
- Currently works as a software developer at Substantial
Overview
- Kiha Software
- Aro Mobile, Kiha's product
- Aro and NLP
- Hackathon!
- What was most useful from CLMA
- What I learned about NLP at Kiha
- Questions?
- Founded in 2008
- Launched Aro Mobile in 2010
- I joined in Sept, 2010
Aro Mobile
- Service and personal information manager (PIM) for Android that:
- Understands your data
- Makes you faster and more efficient at managing email, contacts, calendar
Aro Mobile -- how does it work?
You give Aro your email credentials
Aro reads all your email
Aro keep models of who/what is important to you
Aro make it easy to communicate with who you want
Aro Mobile -- how does it work?
Hi Josh,
Could you give Mark a call at 3pm tomorrow?
Thanks,
Bob
Bob the Builder
206 123 1234
bob@bigbricks.com
1337 Pro Builder Lane
Seattle, WA 98122
Aro Mobile's Email Client
Aro and NLP
- Extracted entities:
- Names, places, businesses
- Phone numbers
- Tracking numbers (USPS, UPS, Fedex)
- Flight numbers
- Email signatures
- Calendar events
- Addresses
- Email addresses
Aro and NLP
- Models
- CRF trained IOB sequence labeler
- Minorthird
- Data
- Newswire corpora
- Annotated email (in-house and Mechanical Turk)
What I did at Kiha
- Automating model training through continuous integration with Maven
- Feature engineering for NER models (POS tags, NP chunks)
- Improving performance, while maintaining precision/recall
Hackathon!
- Extending EES: recognizing conference call meeting numbers
- Hacking the Android app to auto-dial conference calls
Hi Bob,
Dora the Explorer invites you to attend this online meeting.
Topic: Computational Linguistics
Date: Monday, January 10, 2013
Time: 2:15 pm, Pacific Standard Time (San Francisco, GMT-08:00)
Meeting Number: 736 771 237
CCP:+16504293300x736771237#
What was most useful from CLMA
- 570, 572
- i2b2 Medical Extraction Challenge
- Experience with MALLET, libSVM
- Familiarity with what algorithms and techniques are available
What I learned about NLP at Kiha
- Use existing libaries -- most of my time was spent thinking about features (not algorithms)
- Sometimes boring techniques are good enough (e.g. name recognition from list of names)
- Learn to have a sense of what "good enough" performance is
Final words
- Kiha has since rebranded and changed their product direction
- For current job seekers: Hacker News Meetup, Seattle.rb, Beer && Code