Visible to the public SafeAI 2023Conflict Detection Enabled

The AAAI's Workshop on Artificial Intelligence Safety

The accelerated developments in the field of Artificial Intelligence (AI) hint at the need for considering Safety as a design principle rather than an option. However, theoreticians and practitioners of AI and Safety are confronted with different levels of safety, different ethical standards and values, and different degrees of liability, that force them to examine a multitude of trade-offs and alternative solutions. These choices can only be analyzed holistically if the technological and ethical perspectives are integrated into the engineering problem, while considering both the theoretical and practical challenges of AI safety. A new and comprehensive view of AI Safety must cover a wide range of AI paradigms, including systems that are application-specific as well as those that are more general, considering potentially unanticipated risks. In this workshop, we want to explore ways to bridge short-term with long-term issues, idealistic with pragmatic solutions, operational with policy issues, and industry with academia, to build, evaluate, deploy, operate and maintain AI-based systems that are demonstrably safe.

This workshop seeks to explore new ideas on AI safety with particular focus on addressing the following questions:

  • What is the status of existing approaches in ensuring AI and Machine Learning (ML) safety, and what are the gaps?
  • How can we engineer trustable AI software architectures?
  • How can we make AI-based systems more ethically aligned?
  • What safety engineering considerations are required to develop safe human-machine interaction?
  • What AI safety considerations and experiences are relevant from industry?
  • How can we characterize or evaluate AI systems according to their potential risks and vulnerabilities?
  • How can we develop solid technical visions and new paradigms about AI Safety?
  • How do metrics of capability and generality, and the trade-offs with performance affect safety?

The main interest of the proposed workshop is to look at a new perspective of system engineering where multiple disciplines such as AI and safety engineering are viewed as a larger whole, while considering ethical and legal issues, in order to build trustable intelligent autonomy.

  • Contributions are sought in (but are not limited to) the following topics:
  • Safety in AI-based system architectures
  • Continuous V&V and predictability of AI safety properties
  • Runtime monitoring and (self-)adaptation of AI safety
  • Accountability, responsibility and liability of AI-based systems
  • Effect of uncertainty in AI safety
  • Avoiding negative side effects in AI-based systems
  • Role and effectiveness of oversight: corrigibility and interruptibility
  • Loss of values and the catastrophic forgetting problem
  • Confidence, self-esteem and the distributional shift problem
  • Safety of Artificial General Intelligence (AGI) systems and the role of generality
  • Reward hacking and training corruption
  • Self-explanation, self-criticism and the transparency problem
  • Human-machine interaction safety
  • Regulating AI-based systems: safety standards and certification
  • Human-in-the-loop and the scalable oversight problem
  • Evaluation platforms for AI safety
  • AI safety education and awareness
  • Experiences in AI-based safety-critical systems, including industrial processes, health, automotive systems, robotics, critical infrastructures, among others


Organizing Committee

  • Gabriel Pedroza, CEA LIST, France
  • Xiaowei Huang, University of Liverpool, UK
  • Xin Cynthia Chen, ETH Zurich, Switzerland
  • Andreas Theodorou, Umea University, Sweden

Steering Committee

  • Jose Hernandez-Orallo, Universitat Politecnica de Valencia, Spain
  • Mauricio Castillo-Effen, Lockheed Martin, USA
  • Richard Mallah, Future of Life Institute, USA
  • John McDermid, University of York, UK
  • Program Committee
  • Huascar Espinoza, KDT JU, Belgium
  • Stuart Russell, UC Berkeley, USA
  • Raja Chatila, Sorbonne University, France
  • Francesca Rossi, IBM and University of Padova, USA
  • Roman V. Yampolskiy, University of Louisville, USA
  • Gereon Weiss, Fraunhofer IKS, Germany
  • Roman Nagy, Argo AI, Germany
  • Nathalie Baracaldo, IBM Research, USA
  • Chokri Mraidha, CEA LIST, France
  • Brent Harrison, University of Kentucky, USA
  • Toshihiro Nakae, DENSO Corporation, Japan
  • John Favaro, Trust-IT, Italy
  • Agnes Delaborde, LNE, France
  • Jonas Nilsson, NVIDIA, USA
  • Leon Kester, TNO, The Netherlands
  • Michael Paulitsch, Intel, Germany
  • Philippa Ryan Conmy, University of York, UK
  • Stefan Kugele, Technische Hochschule Ingolstadt, Germany
  • Javier Ibanez-Guzman, Renault, France
  • Mehrdad Saadatmand, RISE SICS, Sweden
  • Alessio R. Lomuscio, Imperial College London, UK
  • Jeremie Guiochet, LAAS-CNRS, France
  • Sandhya Saisubramanian, University of Massachusetts Amherst, USA
  • Mario Gleirscher, University of Bremen, Germany
  • Chris Allsopp, Origami Labs, UK
  • Vahid Behzadan, University of New Haven, USA
  • Simos Gerasimou, University of York, UK
  • Feng Liu, Huawei Munich Research Center, Germany
  • Juliette Mattioli, Thales, France
  • Brian Tse, Affiliate at University of Oxford, China
  • Colin Paterson, University of York, UK
  • Peter Flach, University of Bristol, UK
  • Simon Fuerst, BMW Group, Germany
  • Emmanuelle Escorihuela, Airbus, France
  • Roel Dobbe, TU Delft, The Netherlands
  • Andrea Orlandini, ISTC-CNR, Italy
  • Ke Pei, Huawei, China
  • Mohamed Ibn Khedher, IRT SystemX, France
  • Ganesh Pai, NASA Ames Research Center, USA
  • Davide Bacciu, Universita di Pisa, Italy
  • Rasmus Adler, Fraunhofer IESE, Germany
  • Danilo Vasconcellos Vargas, Kyushu University, Japan
  • Vahid Hashemi, Audi, Germany
  • Umut Durak, German Aerospace Center (DLR), Germany
  • Morayo Adedjouma, CEA LIST, France
  • John Burden, University of Cambridge, UK
  • Luciano Cavalcante Siebert, TU Delft, The Netherlands
  • Timo Samann, Valeo, Germany
  • Jan Reich, Fraunhofer IESE, Germany
  • Mandar Pitale, NVIDIA, USA
  • Nikolaos Matragkas, CEA LIST, France
  • Bowei Xi, University of Purdue, USA
  • Fateh Kaakai, Thales, France
Event Date: 
Mon, 02/13/2023 - 7:00am - Tue, 02/14/2023 - 7:00pm
Event Details
Washington D.C.