license: mit
ConvNeXtV2-IllustrationScorer
Q0: What does this model do?
A: π This model scores your anime-style illustrations based on 4 metrics. π
Q1: What does the 4 metrics means?
A: π The 4 metrics measures the "Liking Rate", "Collection Rate", "AI-generated Probability", and "View Number / Uploaded Interval (i.e. Popularity)". π
Q2: Why the "Rate" seems not being a rate?
A: β¨ This is because the author did not train this model by regressing these "Rates". Instead, these values are obtained in a contrastive learning manner (i.e., ranking the top-k images for each "Rate"). This is because the author has observed that almost no gradient can be significantly observed by backwarding on these "Rates" if the model is trained by regressing these values. And simply, the author assumed that the model tried to minimize the Absolute Error Loss by "remembering the average value", which is not an expected result. β¨