Use of Machine Learning to Thwart Phishing Attacks

Abstract: Email-based phishing has evolved from a long-shot one-off scam to a routine, mechanized, effective practice driving over a billion dollars of wire fraud a year. This talk will detail how machine learning -- specifically anomaly detection algorithms -- can identify and flag spear phishing attempts. We'll work through specific real-world examples and detail how we identify them.


David Baggett graduated Magna Cum Laude with a B.S./B.A. in Computer Science and Linguistics from the University of Maryland in 1992. As an undergraduate, he worked extensively with Bill Pugh and Bill Gasarch in the Computer Science Department, and with Sharon Inkelas in the Linguistics Department. In addition to undergraduate work in type theory and recursion theory, David studied tonal phenomena in African languages. He received the CMPS Dean's Award for Academic Excellence in 1992.

After the University of Maryland, David entered the Ph.D. program at the MIT Artificial Intelligence Laboratory, where he studied computational linguistics with Robert Berwick under an Office of Naval Research (NDSEG) Graduate Fellowship. His 1994 master's thesis, A System for Computational Phonology included a complete implementation of autosegmental phonological theory.

In 1994, David left MIT to join video game company Naughty Dog, where he co-developed the Crash Bandicoot series for the Sony Playstation. The Crash games were worldwide best-sellers that redefined the state of the art, leading Sony to adopt Crash as its mascot character. The success and appeal of Crash significantly contributed to Sony's ability to enter the video game console market and quickly establish supremacy over former industry leaders Nintendo and Sega.

The Crash series represented a technical high water mark for the Playstation 1. As one of two developers on the first game in the series, David pioneered the use of distributed polygonal scene precomputation to vastly reduce rendering time and greatly increase scene complexity. David also introduced octree-based collision detection — an essential innovation for Crash's lush, organic environments — to the video game world. David implemented many of the level processing tools and the entire rendering pipeline, and produced the music for the first three games in the series.

David next co-founded ITA Software with two other MIT AI Lab graduates. ITA Software developed the first new airfare pricing and shopping software in decades, and licensed the technology to most major US and many international airlines. ITA Software's technology powers OrbitzKayakHipmunk, and many other travel industry websites. Google completed its purchase of ITA Software for over $700M in April, 2011. ITA Software now powers Google Flights.


David's latest venture is Inky -- a cloud-based email protection system that uses machine learning and computer vision algorithms to identify email forgeries and prevent pishing attacks. Try it!

David lives in the Washington, DC area with his wife, Catherine, and their two children — Charlie and Lizzie.

A detailed Resume/Curriculum Vitae is here.

500 W. 120th St., 918 Mudd, New York, NY 10027    212-854-2905                
©2012 Columbia University