What are your thoughts on the DallE3 “paper” which doesn’t cover technical or architectural details? The only useful takeaway seems to be “higher quality data is better” and “image captioning models that provide a great amount of detail can create good datasets.”
yeah, this paper feels more like a publicity move than actual research. if they’re not sharing the technical details, it’s not really contributing to the field. we need transparency for progress to happen.