CSR SDG

2024 IEEE CSR Workshop on Synthetic Data Generation for a Cyber-Physical World (SDG)

September 2-4 – Hilton London Tower Bridge (Hybrid Event)

Synthetic datasets that reflect the statistical properties of authentic data allow to share research insights and findings without compromising privacy or proprietary interests. This approach not only promotes transparency and reproducibility in research but also encourages interdisciplinary collaboration and knowledge sharing. Artificial Intelligence (AI) is one of the main areas utilizing generated synthetic data. Privacy issues arise once the dataset contains sensitive features playing a role in training AI systems. Due to the fact that data collection is expensive and time-consuming, given some shortcomings such as low volume of data, non-compliance with regulations, and bias, we not only may achieve biased and low-performance models but also violate privacy principles. Synthetic data generation can facilitate analysis, the need for data augmentation, or prevent data breaches in highly sensitive domains, rather than weak anonymization approaches. Generative Adversarial Networks (GAN), Variational Autoencoders (VAE), and Agent-based modelling (ABM) are among the most common synthetic data generation algorithms.

However, it is critical to recognize the limitations of synthetic data generation, particularly in capturing the intricacies and interdependencies present in real-world systems. While synthetic datasets can mimic statistical distributions and patterns, they may struggle to replicate the nuanced relationships and contextual nuances inherent in complex phenomena. By leveraging advances in artificial intelligence, machine learning, and computational modelling, researchers can strive to bridge the gap between synthetic and authentic data, unlocking new opportunities for insight and innovation in fields as diverse as healthcare, finance, social sciences, and beyond

Topics of Interest

Prospective authors are encouraged to submit previously unpublished contributions from a broad range of topics, which include but are not limited to the following:

› Privacy preserving in healthcare data
› Algorithms for debiasing dataset (in the pre-processing phase of ML modelling)
› Algorithms for debiasing the ML models’ results
› Uncovering and mitigating synthetic data algorithmic bias
› Assurance and certification of the dataset and ML models

› Synergy of ABM with ML focusing on the rule extraction
› Domain dependent/independent synthetic data generation challenges and opportunities
› FAIR (findability, accessibility, interoperability, and reuse) and ethical synthetic data generation
› Explainability and interpretability aspects in synthetic data generation

Important Dates

Paper submission deadline: June 3, 2024 AoE
Authors’ notification: July 3, 2024 AoE
Camera-ready submission: July 14, 2024 AoE
Early registration deadline: July 20, 2024 AoE
Workshop date: September 2-4, 2024

Submission Guidelines

The workshop’s proceedings will be published by IEEE and will be included in IEEE Xplore. The guidelines for authors, manuscript preparation guidelines, and policies of the IEEE CSR conference are applicable to SDGCP 2024 workshop. Please visit the authors’ instructions page for more details. When submitting your manuscript via the conference management system, please make sure that the workshop’s track 2T6 SDGCP is selected in the Topic Areas drop down list.

Workshop Committees

Workshop chairs

Samira Maghool, Universita degli Studi di Milano (IT)
Faiza Allah Bukhsh, University of Twente (NL)

Workshop chairs

Ernesto Damiani, Khalifa University (AE)
Samira Maghool, Universita degli Studi di Milano (IT)
Faiza Allah Bukhsh, University of Twente (NL)

Publicity chair

Filippo Berto, Universit`a degli Studi di Milano(IT)
Samira Maghool, Universit`a degli Studi di Milano (IT)

Contact us

samira.maghool@unimi.it
f.a.bukhsh@utwente.nl

Program committee

Mirela Riveni, University of Groningen (NL)
Juba Agoun, Universit ́e Lumi`ere Lyon 2 (FR)
Valerio Bellandi, Universit`a degli Studi di Milano (IT)
Nicola Bena, Universit`a degli Studi di Milano (IT)
Filippo Berto, Universit`a degli Studi di Milano (IT)
Afshin Montakhab, Shiraz University (AE)
Marco Cremonini, Universit`a degli Studi di Milano (IT)
Elena Casiraghi, Universit`a degli Studi di Milano (IT)
Rob Bemthuis, University of Twente (NL)
Sanja Lazarova-Molnar, Karlsruhe Institute of Technology (DE)
Azzam Mourad, Lebanese American University (LB)
Anastasija Nikiforova, University of Tartu (EE)
Mohammad Reza Fani Sani, Microsoft (USA)
Paolo Caravolo, Universit`a degli Studi di Milano (IT)
Maya Daneva, University of Twente (NL)
Jeewanie Jayasinghe Arachchig, University of Twente (NL)
Rabia Maqsood, National University of Computer and Emerging Sciences (PK)
Marco Angelini, University of Rome “La Sapienza” (IT)
Alessandro Palma, University of Rome “La Sapienza” (IT)
Robert Wrembel, Poznan University of Technology (PL)
Ehsan Ullah Munir, Comsats University (PK)