Previous chapters introduced general-purpose metrics for assessing synthetic data quality, focusing on statistical fidelity and machine learning utility applicable across many scenarios. However, synthetic data generation often targets specific data modalities (like images or time-series) or employs distinct generative model architectures (such as GANs or VAEs), each presenting unique evaluation challenges. General metrics may not capture the specific properties or failure modes relevant to these specialized contexts.
This chapter addresses the need for tailored evaluation techniques. You will learn about metrics designed specifically for:
By the end of this chapter, you will be equipped to select and apply appropriate specialized metrics to gain deeper insights into the quality of synthetic data generated for particular applications or by specific model types.
5.1 Evaluating Synthetic Images: FID, IS, Precision, Recall
5.2 Evaluating Synthetic Text: Perplexity, BLEU Scores
5.3 Evaluating Synthetic Time-Series Data
5.4 Metrics for GAN Evaluation
5.5 Metrics for VAE Evaluation
5.6 Hands-on practical: Calculating FID for Image Data
© 2025 ApX Machine Learning