Pieter Abbeel received a BS/MS in Electrical Engineering from KU Leuven (Belgium) and received his Ph.D. degree in Computer Science from Stanford University in 2008. He joined the faculty at UC Berkeley in Fall 2008, with an appointment in the Department of Electrical Engineering and Computer Sciences. He has won various awards, including best paper awards at ICML and ICRA, the Sloan Fellowship, the Air Force Office of Scientific Research Young Investigator Program (AFOSR-YIP) award, the Okawa Foundation award, the 2011's TR35, the IEEE Robotics and Automation Society (RAS) Early Career Award, and the Dick Volz Best U.S. Ph.D. Thesis in Robotics and Automation Award. He has developed apprenticeship learning algorithms which have enabled advanced helicopter aerobatics, including maneuvers such as tic-tocs, chaos and auto-rotation, which only exceptional human pilots can perform. His group has also enabled the first end-to-end completion of reliably picking up a crumpled laundry article and folding it. His work has been featured in many popular press outlets, including BBC, New York Times, MIT Technology Review, Discovery Channel, SmartPlanet and Wired. His current research focuses on robotics and machine learning with a particular focus on challenges in personal robotics, surgical robotics and connectomics.
Abstract
This objective of this proposal is to improve the management of the air traffic system, a cyber-physical system where the need for a tight connection between the computational algorithms and the physical system is critical to safe, reliable and efficient performance. The approach is based on an adaptive multiagent coordination algorithm with a particular emphasis on the systematic selection of the agents, their actions and the agents' reward functions. The intellectual merit lies in addressing the agent coordination problem in a physical setting by shifting the focus from "how to learn" to "what to learn." This paradigm shift allows a separation between the learning algorithms used by agents, and the reward functions used to tie those learning systems into system performance. By exploring agent reward functions that implicitly model agent interactions based on feedback from the real world, this work aims to build cyber-physical systems where an agent that learns to optimize its own reward leads to the optimization of the system objective function. The broader impact is in providing new air traffic flow management algorithms that will significantly reduce air traffic congestion. The potential impact cannot only be measured in currency ($41B loss in 2007) but in terms of improved experience by all travelers, providing a significant benefit to society. In addition, the PIs will use this project to train graduate and undergraduate students (i) by developing new courses in multiagent learning for transportation systems; and (ii) by providing summer internship opportunities at NASA Ames Research Center.
Performance Period: 09/01/2009 - 08/31/2014
Institution: Oregon State University
Sponsor: National Science Foundation
Award Number: 0931591
Abstract
The objective of this research is to develop new scientific and engineering principles, algorithms and models for the design of battery powered cyber-physical systems whose computational substrates include high-performance multiprocessor systems-on-chip. The approach is to design control tasks that guarantee performance and meet criteria for battery operation time. Task schedulers are co-designed to balance the computing load across the multiple processors, and to control the physical plant together with the control tasks. The controller and scheduler will be integrated with battery management algorithms through a systems theory approach so that the methods are provably correct with justfiable performance. Intellectual Merit: The program will create progress in digital and hybrid control theory that keeps up with the recent trend of using multiprocessor systems-on-chips for control and robotic applications. The mechanism for the migration of control tasks between multiple processors will respect physical and thermal performance. A novel battery dynamic discharge model is developed, which may be applied to context when the discharge current of batteries cannot be predicted by existing static battery models. Broader Impacts: Collaborations with industrial partners have been set up. The program offers multidisciplinary training in cyber-physical systems. A teaching and outreach lab is in place to host K-12 student teams that participate in robot competitions, and has become an Explorer Post for Boy Scout of America.
Performance Period: 09/01/2009 - 08/31/2013
Institution: GA Tech Research Corporation - GA Institute of Technology
Sponsor: National Science Foundation
Award Number: 0931576
Abstract
The objective of this research is to develop technologies to improve the efficiency and safety of the road transportation infrastructure. The approach is to develop location-based vehicular services combining on-board automotive computers, in-car devices, mobile phones, and roadside monitoring/surveillance systems. The resulting vehicular Cyber Physical Systems (CPS) can reduce travel times with smart routing, save fuel and reduce carbon emissions by determining greener routes and commute times, improve safety by detecting road hazards, change driving behavior using smart tolling, and enable measurement-based insurance plans that incentivize good driving. This research develops distributed algorithms for predictive travel delay modeling, feedback-based routing, and road hazard assessment. It develops privacy-preserving protocols for capturing and analyzing data and using it for tasks such as congestion-aware tolling. It also develops a secure macro-tasking software run-time substrate to ensure that algorithms can be programmed centrally without explicitly programming each node separately, while ensuring that it is safe to run third-party code. The research focuses on re-usable methods that can benefit multiple vehicular services, and investigates which lessons learned from this vehicular CPS effort generalize to other situations. Road transportation is a grand challenge problem for modern society, which this research can help overcome. Automobile vendors, component developers, and municipal authorities have all shown interest in deployment. The education plan includes outreach to local K-12 students and a new undergraduate course on transportation from a CPS perspective, which will involve term projects using the data collected in the project
Performance Period: 10/01/2009 - 09/30/2014
Institution: Massachusetts Institute of Technology
Sponsor: National Science Foundation
Award Number: 0931550
Abstract
The objective of this research is to create computational foundation, methods, and tools for efficient and autonomous optical micromanipulation using microsphere ensembles as grippers. The envisioned system will utilize a holographic optical tweezer, which uses multiple focused optical traps to position microspheres in three-dimensional space. The proposed approach will focus on the following areas. First, it will provide an experimentally validated optical-tweezers based workstation for concurrent manipulation of multiple cells. Second, it will provide algorithms for on-line monitoring of workspace to support autonomous manipulation. Finally, it will provide real-time image-guided motion planning strategies for transporting microspheres ensembles. The proposed work will lead to a new way of autonomously manipulating difficult-to-trap or sensitive objects using microspheres ensembles as reconfigurable grippers. The proposed work will also lead to fundamental advances in several cyber physical systems areas by providing new approaches to micromanipulations, fast and accurate algorithms with known uncertainty bounds for on-line monitoring of moving microscale objects, and real-time motion planning algorithms to transport particle ensembles. The ability to quickly and accurately manipulate individual cells with minimal training will enable researchers to conduct basic research at the cellular scale. Control over cell-cell interactions will enable unprecedented insights into cell signaling pathways and open up new avenues for medical diagnosis and treatment. The proposed integration of research with education will train students with a strong background in emerging robotics technologies and the inner workings of cells. These students will be in a unique position to rapidly develop and deploy specialized robotics technologies.
Performance Period: 09/01/2009 - 08/31/2014
Institution: University of Maryland College Park
Sponsor: National Science Foundation
Award Number: 0931508
Abstract
The physical environment of a cyber-physical system is unboundedly complex, changing continuously in time and space. An embodied cyber-physical system, embedded in the physical world, will receive a high bandwidth stream of sensory information, and may have multiple effectors with continuous control signals. In addition to dynamic change in the world, the properties of the cyber-physical system itself ? its sensors and effectors ? change over time. How can it cope with this complexity? The hypothesis behind this proposal is that a successful cyber-physical system will need to be a learning agent, learning the properties of its sensors, effectors, and environment from its own experience, and adapting over time. Inspired by human developmental learning, the assertion is that foundational concepts such as Space, Object, Action, etc., are essential for such a learning agent to abstract and control the complexity of its world. To bridge the gap between continuous interaction with the physical environment, and discrete symbolic descriptions that support effective planning, the agent will need multiple representations for these foundational domains, linked by abstraction relations. To achieve this, the team is developing the Object Semantic Hierarchy (OSH), which shows how a learning agent can create a hierarchy of representations for objects it interacts with. The OSH shows how the ?object abstraction? factors the uncertainty in the sensor stream into object models and object trajectories. These object models then support the creation of action models, abstracting from low-level motor signals. To ensure generality across cyber-physical systems, these methods make only very generic assumptions about the nature of the sensors, effectors, and environment. However, to provide a physical test bed for rapid evaluation and refinement of our methods, the team has designed a model laboratory robotic system to be built from off-the-shelf components, including a stereo camera, a pan-tilt-translate base, and a manipulator arm. For dissemination and replication of research results, the core system will be affordable and easily duplicated at other labs. There are plans to distribute the plans, the control software, and the software for experiments, to encourage other labs to replicate and extend the work. The same system will serve as a platform for an open-ended set of undergraduate laboratory tasks, ranging from classroom exercises, to term projects, to independent study projects. There is a preliminary design for a very inexpensive version of the model cyberphysical system that can be constructed from servo motors and pan-tilt webcams, for use in collaborating high schools and middle schools, to communicate the breadth and excitement of STEM research.
Performance Period: 09/01/2009 - 08/31/2014
Institution: University of Michigan Ann Arbor
Sponsor: National Science Foundation
Award Number: 0931474
Abstract
The objective of this research is to enable operation of synthetic and cyborg insects in complicated environments, such as outdoors or in a collapsed building. As the mobile platforms and environment have significant uncertainty, learning and adaptation capabilities are critical. The approach consists of three main thrusts to enable the desired learning and adaptation: (i) Development of algorithms to efficiently learn optimal control policies and dynamics models through sharing the learning and adaptation between various instantiations of platforms and environments. (ii) Creation of control learning algorithms which can be run on low-cost, low-power mobile platforms. (iii) Development of algorithms for online improvement of policy performance in a minimal number of real-world trials. The proposed research will advance learning and adaptation capabilities of practical cyberphysical systems. The proposed approach will be generally applicable and lead to a new class of learning and adapting systems that are able to leverage shared properties between multiple tasks to significantly speed up learning and adaptation. Success in this research project will bring society closer to solving the grand challenge of teams of mobile, disposable, search and rescue robots which can robustly locomote through uncertain and novel environments, finding survivors in disaster situations, while removing risk from rescuers. This project will provide interdisciplinary training through research and classwork for undergraduate and graduate students in creating systems which intimately couple the cyber and physical aspects in robotic and living mobile platforms. Through the SUPERB summer program, under-represented students in engineering will experience research in learning and robotics.
Pieter Abbeel
Performance Period: 09/01/2009 - 08/31/2013
Institution: University of California-Berkeley
Sponsor: National Science Foundation
Award Number: 0931463
Abstract
The objective of this research is to study the formal design and verification of advanced vehicle dynamics control systems. The approach is to consider the vehicle-driver-road system as a cyber-physical system (CPS) by focusing on three critical components: (i) the tire-road interaction; (ii) the driver-vehicle interaction; and (iii) the controller design and validation. Methods for quantifying and estimating the uncertainty of the road friction coefficient by using self-powered wireless sensors embedded in the tire are developed for considering tire-road interaction. Tools for real-time identification of nominal driver behavior and uncertainty bounds by using in-vehicle cameras and body wireless sensors are developed for considering driver-vehicle interaction. A predictive hybrid supervisory control scheme will guarantee that the vehicle performs safely for all possible uncertainty levels. In particular, for controller design and validation, the CPS autonomy level is continuously adapted as a function of human and environment conditions and their uncertainty bounds quantified by considering tire-road and driver-vehicle interaction. High confidence is critical in all human operated and supervised cyber-physical systems. These include environmental monitoring, telesurgery, power networks, and any transportation CPS. When human and environment uncertainty bounds can be predicted, safety can be robustly guaranteed by a proper controller design and validation. This avoids lengthy and expensive trial and error design procedures and drastically increases their confidence level. Graduate, undergraduate and underrepresented engineering students benefit from this project through classroom instruction, involvement in the research and substantial interaction with industrial partners from the fields of tires, vehicle active safety, and wireless sensors.
Performance Period: 09/01/2009 - 08/31/2012
Institution: University of California-Berkeley
Sponsor: National Science Foundation
Award Number: 0931437
Abstract
The objective of this research is the transformation from static sensing into mobile, actuated sensing in dynamic environments, with a focus on sensing in tidally forced rivers. The approach is to develop inverse modeling techniques to sense the environment, coordination algorithms to distribute sensors spatially, and software that uses the sensed environmental data to enable these coordination algorithms to adapt to new sensed conditions. This work relies on the concurrent sensing of the environment and actuation of those sensors based on sensed data. Sensing the environment is approached as a two-layer optimization problem. Since mobile sensors in dynamic environments may move even when not actuated, sensor coordination and actuation algorithms must maintain connectivity for the sensors while ensuring those sensors are appropriately located. The algorithms and software developed consider the time scales of the sensed environment, as well as the motion capabilities of the mobile sensors. This closes the loop from sensing of the environment to actuation of the devices that perform that sensing. This work is addresses a challenging problem: the management of clean water resources. Tidally forced rivers are critical elements in the water supply for millions of Californians. By involving students from underrepresented groups, this research provides a valuable opportunity for students to develop an interest in engineering and to learn first hand about the role of science and engineering in addressing environmental issues.
Performance Period: 09/01/2009 - 08/31/2012
Institution: University of California-Berkeley
Sponsor: National Science Foundation
Award Number: 0931348
Abstract
The objective of this research is to develop the scientific foundation for the quantitative analysis and design of control networks. Control networks are wireless substrates for industrial automation control, such as the WirelessHART and Honeywell's OneWireless, and have fundamental differences over their sensor network counterparts as they also include actuation and the physical dynamics. The approach of the project focuses on understanding cross-cutting interfaces between computing systems, control systems, sensor networks, and wireless communications using time-triggered architectures. The intellectual merit of this research is based on using time-triggered communication and computation as a unifying abstraction for understanding control networks. Time-triggered architectures enable the natural integration of communication, computation, and physical aspects of control networks as switched control systems. The time-triggered abstraction will serve for addressing the following interrelated themes: Optimal Schedules via Quantitative Automata, Quantitative Analysis and Design of Control Networks: Wireless Protocols for Optimal Control: Quantitative Trust Management for Control Networks. Various components of this research will be integrated into the PIs' RAVEN control network which is compatible with both WirelessHART and OneWireless specifications. This provides a direct path for this proposal to have immediate industrial impact. In order to increase the broader impact of this project, this project will launch the creation of a Masters' program in Embedded Systems, one of the first in the nation. The principle that guides the curriculum development of this novel program is a unified systems view of computing, communication, and control systems.
Performance Period: 09/01/2009 - 08/31/2014
Institution: University of Pennsylvania
Sponsor: National Science Foundation
Award Number: 0931239
Abstract
The objective of this research is to develop algorithms for wireless sensor-actuator networks (WSAN) that allow control applications and network servers to work together in maximizing control application performance subject to hard real-time service constraints. The approach is a model-based approach in which the WSAN is unfolded into a real-time fabric that captures the interaction between the network's cyber-processes and the application's physical-processes. The project's approach faces a number of challenges when they are applied to wireless control systems. This project addresses these challenges by 1) using network calculus concepts to pose a network utility maximization (NUM) problem that maximizes overall application performance subject to network capacity constraints, 2) using event-triggered message passing schemes to reduce communication overhead, 3) using nonlinear analysis methods to more precisely characterize the problem's utility functions, and 4) using anytime control concepts to assure robustness over wide variations in network connectivity. The project's impact will be broadened through interactions with industrial partner, EmNet LLC. The company will use this project's algorithms on its CSOnet system. CSOnet is a WSAN controlling combined-sewer overflows (CSO), an environmental problem faced by nearly 800 cities in the United States. The project's impact will also be broadened through educational outreach activities that develop a graduate level course on formal methods in cyber-physical systems. The project's impact will be broadened further through collaborations with colleagues working on networked control systems under the European Union's WIDE project.
Michael Lemmon
Michael Lemmon is a professor of electrical engineering at the University of Notre Dame. He received his PhD and MS in EE from Carnegie-Mellon University in 1987 and 1990, respectively. He got his BSEE from Stanford University in 1979. He was an aerospace engineering from 1979-1986. He joined the faculty of electrical engineering at Notre Dame in 1990. His early research was on neural network, hybrid and cyber-physical systems, wireless sensor-actuator networks, and networked control systems. He is currently studying deep learning for robust adaptive control.
Performance Period: 09/01/2009 - 08/31/2014
Institution: University of Notre Dame
Sponsor: National Science Foundation
Award Number: 0931195