arXiv Analytics

Sign in

arXiv:2301.07573 [cs.LG]AbstractReferencesReviewsResources

Synthcity: facilitating innovative use cases of synthetic data in different data modalities

Zhaozhi Qian, Bogdan-Constantin Cebere, Mihaela van der Schaar

Published 2023-01-18Version 1

Synthcity is an open-source software package for innovative use cases of synthetic data in ML fairness, privacy and augmentation across diverse tabular data modalities, including static data, regular and irregular time series, data with censoring, multi-source data, composite data, and more. Synthcity provides the practitioners with a single access point to cutting edge research and tools in synthetic data. It also offers the community a playground for rapid experimentation and prototyping, a one-stop-shop for SOTA benchmarks, and an opportunity for extending research impact. The library can be accessed on GitHub (https://github.com/vanderschaarlab/synthcity) and pip (https://pypi.org/project/synthcity/). We warmly invite the community to join the development effort by providing feedback, reporting bugs, and contributing code.

Related articles: Most relevant | Search more
arXiv:2407.00116 [cs.LG] (Published 2024-06-27)
Generative AI for Synthetic Data Across Multiple Medical Modalities: A Systematic Review of Recent Developments and Challenges
arXiv:2410.08942 [cs.LG] (Published 2024-10-11)
Maximizing the Potential of Synthetic Data: Insights from Random Matrix Theory
arXiv:2406.04890 [cs.LG] (Published 2024-06-07)
Enhancing Indoor Temperature Forecasting through Synthetic Data in Low-Data Environments