Governing AI Data: Real vs. Synthetic Integrity Challenges