Collaborative Research: EAGER: CPS: Data Augmentation and Model Transfer for the Internet of Things
Nick Feamster
Lead PI:
Nick Feamster
Abstract
This project is advancing the field of anomaly detection within the realm network management for the Internet of Things (IoT). This research field, which is also known as ?novelty detection,? is critical for identifying unusual events in network traffic, ranging from security breaches to hardware failures. Unfortunately, various technical gaps have hampered widespread adoption of these techniques. A significant barrier to development and ultimate adoption is the lack of labeled data in IoT environments necessary for training effective machine learning models.<br/><br/>To address this problem, this project is developing techniques to improve novelty detection models through the generation of labeled datasets in IoT settings. The project aims to address these gaps through data augmentation and novelty transfer to increase the availability of labeled data. Leveraging available data from the IoT laboratory at the University of Chicago and emerging synthetic traffic trace generation capabilities, this project is creating a comprehensive dataset comprising network traffic, as well as various multi-modal data from various IoT devices. This dataset will be labeled with diverse activities and features, including device information, user activity, and instances of novelty such as network attacks or physical breaches. This project is also developing techniques to assign confidence to labels and transfer models from controlled laboratory settings to real-world deployments, a process known as novelty transfer. This involves developing robust machine learning methods capable of handling deviations from learned traffic patterns, particularly in unbalanced datasets.<br/><br/>The project combines system-oriented activities with advancements in machine learning techniques. Broader impacts include the development and release of a public, open-source software library for machine learning on network traffic data, as well as educational initiatives aimed at promoting machine learning for networking through both in-person and online courses, textbooks, and other<br/>educational material.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
Nick Feamster
Nick Feamster is Neubauer Professor of Computer Science and the Director of Research at the Data Science Institute at the University of Chicago. Previously, he was a full professor in the Computer Science Department at Princeton University, where he directed the Center for Information Technology Policy (CITP); prior to Princeton, he was a full professor in the School of Computer Science at Georgia Tech. His research focuses on many aspects of computer networking and networked systems, with a focus on network operations, network security, and censorship-resistant communication systems. He received his Ph.D. in Computer science from MIT in 2005, and his S.B. and M.Eng. degrees in Electrical Engineering and Computer Science from MIT in 2000 and 2001, respectively. He was an early-stage employee at Looksmart (acquired by AltaVista), where he wrote the company's first web crawler; and at Damballa, where he helped design the company's first botnet-detection algorithm. Nick is an ACM Fellow. He received the Presidential Early Career Award for Scientists and Engineers (PECASE) for his contributions to cybersecurity, notably spam filtering. His other honors include the Technology Review 35 "Top Young Innovators Under 35" award, the ACM SIGCOMM Rising Star Award, a Sloan Research Fellowship, the NSF CAREER award, the IBM Faculty Fellowship, the IRTF Applied Networking Research Prize, and award papers at ACM SIGCOMM (network-level behavior of spammers), the SIGCOMM Internet Measurement Conference (measuring Web performance bottlenecks), and award papers at USENIX Security (circumventing web censorship using Infranet, web cookie analysis) and USENIX Networked Systems Design and Implementation (fault detection in router configuration, software-defined networking). His seminal work on the Routing Control Platform won the USENIX Test of Time Award for its influence on Software Defined Networking.
Performance Period: 05/01/2024 - 04/30/2025
Award Number: 2334996