Title
Using Machine Learning to Improve the Email Experience
Abstract
Email is an essential communication medium for billions of people, with most users relying on web-based email services. Two recent trends are changing the email experience: smartphones have become the primary tool for accessing online services including email, and machine learning has come of age. Smartphones have a number of compelling properties (they are location-aware, usually with us, and allow us to record and share photos and videos), but they also have a few limitations, notably limited screen size and small and tedious virtual keyboards. Over the past few years, Google researchers and engineers have leveraged machine learning to ameliorate these weaknesses, and in the process created novel experiences. In this talk, I will give three examples of machine learning improving the email experience. The first example describes how we are improving email search. Displaying the most relevant results as the query is being typed is particularly useful on smartphones due to the aforementioned limitations. Combining hand-crafted and machine-learned rankers is powerful, but training learned rankers requires a relevance-labeled training set. User privacy prohibits us from employing raters to produce relevance labels. Instead, we leverage implicit feedback (namely clicks) provided by the users themselves. Using click logs as training data in a learning-to-rank setting is intriguing, since there is a vast and continuous supply of fresh training data. However, the click stream is biased towards queries that receive more clicks – e.g. queries for which we already return the best result in the top-ranked position. I will summarize our work [2] on neutralizing that bias. The second example describes how we extract key information from appointment and reservation emails and surface it at the appropriate time as a reminder on the user’s smartphone. Our basic approach [3] is to learn the templates that were used to generate these emails, use these templates to extract key information such as places, dates and times, store the extracted records in a personal information store, and surface them at the right time, taking contextual information such as estimated transit time into account.
Year
DOI
Venue
2016
10.1145/2983323.2983371
ACM International Conference on Information and Knowledge Management
Keywords
Field
DocType
Email,Information Extraction,Machine Learning,Ranking
Reservation,Data mining,Display size,Computer science,Personally identifiable information,Artificial intelligence,World Wide Web,Contextual information,HTML email,Clickstream,Ranking,Information retrieval,Information extraction,Machine learning
Conference
Citations 
PageRank 
References 
2
0.36
3
Authors
1
Name
Order
Citations
PageRank
Marc A. Najork12538278.16