I was recently trying to work on a ML project and realized that the I need very specific type of data for what I am trying to achieve which makes me wonder how do people and companies deal with data scarcity, I am pretty sure they at some point need very specific type of data which isn’t easily available
Instrumentation, user studies, being very smart about model choice (can we use an unsupervised model rather than a supervised one?)
Synthetic or partially synthetic data is also helpful, particularly for large models.