Data-Driven Cyberphysical Systems Provably Correct Control in Data Rich/Labels Scarce Scenarios

pdf

The overall goal of this project is to develop theory, methods and tools necessary to take advantage of data in the design, certiﬁcation, manufacturing and operation of cyberphysical systems. The central question we ask is “How can we, in a data-rich world, design and operate cyberphysical systems differently?” Data-driven techniques will help transform the design process of cyberphysical systems into one in which data and models—and human designers and operators—continuously and ﬂuently interact. This poster addresses the portion of this research carried out at Northeastern University, addressing the issue of provably correct decisions for CPS in scenarios with rich run-time data, but where labeling is expensive and off-line training may not capture rare but potentially catastrophic events. Speciﬁcally, we address the following challenges:

Obtaining space-spanning data, specially for situations involving potentially unsafe operations
Real-time labeling with certiﬁable performance, even for data previously unseen
Synthesizing non-conservative, provably stabilizing control laws

Leveraging Simulations to Handle Scarce Labels: The goal here is to obtain data that spans the space of feasible trajectories by exploiting the ability of simulators to generate data cheaply, while avoiding costly tuning to match the simulator to the actual physical system. We propose to accomplish this by recasting the problem into a Domain Adaptation form, where the goal is to ﬁnd a transformation that maps the simulation data (source domain) to the actual CPS data (target domain). This problem is solved by jointly ﬁnding a transformation that aligns the covariances in both domains and a classiﬁer that optimizes classiﬁcation accuracy on the combined adapted source and target data. In turn, this leads to an optimization over a Stiefel manifold that can be efﬁciently solved using recently developed polynomial optimization techniques.

Efﬁcient Data Labeling: While existing state-of-the-art labeling and classiﬁcation methods are very suc-cessful, they rely on computationally intensive training requiring very large amounts of training data. How-ever, such a training is not always feasible for CPS operating in uncertain environments or subject to failure modes previously unseen. As an alternative, we propose the use of data-driven, statistical classiﬁers. The main idea here is to construct sum-of-squares polynomials that approximate the support set of the data for different operating regimes. These SoS polynomials can be directly computed from the empirical moment matrices of the data, at the price of a single singular value decomposition, and new data can be incrementally accommodated by simply updating these matrices. Preliminary results show that for the speciﬁc case of 3D printing, SoS based classiﬁers substantially outperform traditional ones.

Non Conservative Data Driven Control Synthesis: The objective of this portion of the research is to synthesize provably stabilizing control laws for cases where an a-priori model of the system is unavailable. Traditionally, this situation is handled through a multi-step process: (i) systems identiﬁcation, (ii) model (in)validation, and (iii) robust control synthesis. However, this process is computationally involved and the resulting controllers are potentially conservative. As an alternative, we have developed a data-driven approach where the controller is identiﬁed directly from the data, by solving a polynomial optimization problem. The resulting controllers are non-conservative and guaranteed to stabilize all plants compatible with existing side-information and experimental data.

Tags:

Posters (Sessions 8 & 11)

License: CC-2.5

Submitted by Mario Sznaier on Tue, 01/22/2019 - 15:36