'논문 리뷰' 태그의 글 목록

논문 리뷰

[Paper Review] Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks 논문 리뷰 2024.02.21
[Paper Review] Stable Bias: Evaluating Societal Representations in Diffusion Models 논문 리뷰 2024.02.03
[Paper Review] No Token Left Behind: Explainability-Aided Image Classification and Generation 논문 리뷰 2024.01.23 1

[Paper Review] Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks 논문 리뷰

jasonlee1995 2024. 2. 21. 00:07

2024. 2. 21. 00:07

Paper Info

Accepted on ICLR 2024 Spotlight
Authors: Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj
Affiliation: Carnegie Mellon University, Microsoft Research Asia, SusTech, RIKEN AIP, The University of Tokyo, Mohamed bin Zayed University of AI
arXiv link: https://arxiv.org/abs/2309.17002
OpenReview link: https://openreview.net/forum?id=TjhUtloBZU
Task: pre-training dataset의 label noise로 인해 생기는 pre-trained model의 downstream task에서의 성능 저하를 mitigate
- Observation: pre-training의 slight label noise는 in-domain에 도움이 되지만, out-of-domain에는 악영향을 끼침
- Analysis: SVE가 적당히 커서 slight label noise가 in-domain에 좋으며, label noise가 커질수록 LSVR이 커지기에 out-of-domain에 악영향을 끼침
- Mitigation: SVE가 커지도록 + LSVR이 작아지는 loss로 MLP를 학습하여, pre-training label noise의 downstream task에 대한 negative impact를 mitigate
TLDR: singular value spectrum을 이용하여 observation 분석 + 이를 이용한 loss로 mitigation

1. Observation

Pre-trained foundation model을 downstream tasks에 fine-tuning하는 pre-training and fine-tuning (PT-FT) 방식이 de-facto standard가 되었음

Large-scale pre-training dataset에는 web에서 수집한 데이터를 포함하고 있기에, label noise가 존재할 수 밖에 없음

Pre-training data의 label noise가 pre-trained model의 downstream tasks performance에 어떠한 영향을 미치는지에 대한 연구는 존재하지 않았는데, 이를 연구한 첫 논문

Proper noisy labels in pre training (e.g., 5% or 10%) can benefit the performance on ID downstream tasks, while more noise results in inferior results
The robustness of transferability on OOD downstream tasks constantly deteriorates as the noise increases, even with the improvement in ID tasks on 5% noise

즉, pre-training dataset에서 slight label noise는 ID에 도움을 주지만 OOD에는 안좋다는 counter-intuitive한 observation

2. Analysis

Downstream dataset의 pre-trained feature에 대한 singular value spectrum을 이용하여, 관측한 현상을 empirically analyze
(각 downstream task의 entire test set에 대해 singular value spectrum을 구함)

결론만 말하자면 singular value spectrum을 이용하여 구한 SVE, LSVR로 ID, OOD performance를 각각 해석

Singular Value Entropy (SVE) definition from paper

SVE는 singular value distribution의 flatness를 측정
(SVE가 클수록 singular value distribution은 flat)

SVE가 크다는 의미 : feature space가 data의 structure를 더 잘 capture함
(이는 discriminated features 때문일 수도 있고, noise memorization 때문일 수 있음)

Largest Singular Value Ratio (LSVR) definition from paper

LSVR은 largest singular value가 singular values sum 중 차지하는 비율을 측정
(LSVR이 클수록 largest singular value가 차지하는 비율이 작아짐)

LSVR이 크다는 의미 : largest singular value에 해당하는 singular vector가 data variation을 잘 capture하지 못함

기존 연구 중, largest singular value에 해당하는 eigenvector가 feature transferability를 dominate함을 발견함

이를 통해, LSVR이 크다면 feature transferability가 낮다라고 말할 수 있음

SVE & ID tasks
pre-training noise가 커질수록 SVE가 커짐
왜 pre-training dataset의 slight noise가 ID에 도움이 되는가? → slight noise의 SVE가 clean의 SVE보다 커서
data내의 noise를 학습하다보니 feature space의 dimension이 span하게 되는데, 이로 인해 성능이 좋은 것
물론 noise ratio가 증가하게 되면 noisy data structure을 capture하고 memorize하기에, 성능이 감소하게 됨
LSVR & OOD tasks
pre-training noise가 커질수록 LSVR이 커짐
왜 pre-training dataset의 label noise가 OOD에 악영향을 주는가? → LSVR이 커지기에
즉, noise ratio가 커질수록 less transferable components가 학습되어 unseen OOD tasks에서 성능이 안좋은 것

3. Mitigation

Overview of Noisy Model Learning from paper

배경 : foundation model과 같은 large pre-trained model을 full fine-tuning하는 것은 비용이 너무 비쌈
목적 : pre-training에서의 noise가 OOD에서의 성능을 악화시키는 malicious effect를 mitigate하고 싶음

배경과 목적을 고려하여, MLP를 학습하여 pre-trained feature F를 new feature space Z로 transform하여 mitigation

Analysis를 통해 얻은 insight를 이용하여 loss를 설계하고, 이를 이용하여 MLP 학습
(insight : 다양한 feature를 배워야하며, LSVR이 작아야함)

consistency regularization: pre-trained knowledge를 잊지 않고 유지하게끔 하는 loss

covariance regularization: 모델이 다양한 feature를 배우도록 하는 loss
(Barlow Twins, VICReg에서 사용했던 방식, SVE가 커지도록)

dominant singular value regularization from paper

dominant singular value regularization: LSVR을 directly maximize하는 loss

위 3가지 regularization을 합한 NMTune loss + CE loss로 학습하면 pre-training label noise로 인한 negative impact를 mitigation할 수 있다라는 것

Vision, language에서도 NMTune이 효과적임을 보임

4. Conclusions

The author's conclusions

Limitation: linear probing이 NMTune보다 성능이 좋은 경우인 failure case가 존재

저자들은 이에 대해 top-K singular values를 optimize하는 SVD regularization을 사용해야하는데, largest singular value만 optimize했기 때문이라고 추측

top-K에서의 K의 optimal value는 dataset마다 다를텐데, K=1이 다양한 tasks에서 보편적으로 좋은 performance를 보이기에 그냥 사용했다고 적혀있음

My Conclusion

Pre-training에서의 slight label noise가 ID downstream 성능에 도움이 되며, OOD downstream 성능에 해가 된다는 observation은 매우 흥미로움

그러나 limitation에도 언급했듯이, NMTune이 LP보다 성능이 안좋은 failure case가 있기에 올바른 metric으로 분석했는가?에 대한 의문이 남아있음

그럼에도 불구하고 새로운 분야 개척 + 흥미로운 observation이라는 조합은 다양한 생각할 거리들을 제공하기에 가치가 있다고 생각함

Rating

Good

'논문 리뷰 > AI' 카테고리의 다른 글

[Paper Review] Denoising Diffusion Probabilistic Models 논문 리뷰 (0)	2024.05.26
[Paper Review] Texture Synthesis Using Convolutional Neural Networks 논문 리뷰 (0)	2024.05.08
[Paper Review] Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases 논문 리뷰 (2)	2024.03.06
[Paper Review] Stable Bias: Evaluating Societal Representations in Diffusion Models 논문 리뷰 (0)	2024.02.03

[Paper Review] Stable Bias: Evaluating Societal Representations in Diffusion Models 논문 리뷰

jasonlee1995 2024. 2. 3. 20:26

2024. 2. 3. 20:26

Paper Info

Accepted on NeurIPS Datasets and Benchmarks 2023 Spotlight
Authors: Alexandra Sasha Luccioni, Christopher Akiki, Margaret Mitchell, Yacine Jernite
Affiliation: Hugging Face, Leipzig University, ScaDS.AI
arXiv link: https://arxiv.org/abs/2303.11408
OpenReview link: https://openreview.net/forum?id=qVXYU3F017
Task: TTI system의 social bias identification
TLDR: Profession dataset에 대해 annotator-free method를 이용하여 gender, ethnicity bias identification

1. Brief Summary

기존 연구들은 binary gender, fixed prior ethnicity에 대한 classification을 통해 TTI system의 social bias를 identify했음

이러한 classification 기반의 social bias identification 방법론들은 2가지 문제를 가짐

trans와 같은 기존에 없던 attribute에 대한 bias identification을 위해, classification 모델을 새로 학습해야함
학습한 classification 모델이 완벽하지 않음

이를 극복하기 위해, 저자들은 다양한 attributes에 대해 flexible하게 social bias identification할 수 있는 방법론을 제시함

TTI system의 특성을 고려하여 text modality, image modality 측면에서 social bias를 분석하는 방법을 제안하며, AI에 대한 이해도가 부족한 사람도 social bias를 분석할 수 있도록 툴을 제공

stable-bias (Stable Bias)

Stable Bias: Analyzing Societal Representations in Diffusion Models As machine learning-enabled Text-to-Image (TTI) systems are becoming increasingly prevalent and seeing growing adoption as commercial services, characterizing the social biases they exhibi

huggingface.co

2. Methodology: Auditing Social Biases in TTI Systems

Overall concept
1. Bias 평가 기준인 identity characteristic을 정의 (e.g. gender, ethnicity)
2. Identity attributes를 이용하여 input prompts 생성 + TTI system을 이용한 image generation
3. Bias가 있는지 확인할 social characteristic을 정의 (e.g. profession)
4. Social attributes를 이용하여 input prompts 생성 + TTI system을 이용한 image generation
5. TTI system의 특성을 고려하여 text modality, image modality 측면에서 분석

2.1. Generating a Dataset of Identity Characteristics and Social Attributes

Identity dataset
bias 평가 기준이 될 social attribute으로 gender, ethnicity 선택
gender는 trans를 제외하고는 US context 반영, ethnicity는 North American context 반영
4 gender x 17 ethnicity → total 68 combinations
각 prompt당 30 images 생성 → total 2,040 images
Profession dataset
bias가 있는지 audit할 target attribute으로 profession 선택
U.S. Bureau of Labor Statistics (BLS) 반영
total 146 professions
List of attributes used for prompting

list of identity, profession attributes from paper

2.2. Different Approaches for Analyzing Generated Images

TTI의 multi-modal nature을 고려하여 text modality, image modality 측면에서 social bias 분석 + in-depth exploration을 할 수 있는 interactive tools 제공

2.2.1. Text Features Analysis: Image-to-Text Systems

Image captioning, VQA 모델을 이용하여 Profession dataset의 images에 대한 text descriptions를 생성
(ViT GPT-2 for image captioning, BLIP VQA for Visual Question Answering)

Image captioning을 통해 multi-word captions를, VQA model + question을 통해 single word or short phrase를 얻음
(VQA question: "What word best describes this person's appearance?")

구한 image captions, VQA answers에 gender-marked words가 포함됐을 likelihood를 이용하여 social bias 분석

(man, woman → gender-marked words / person, profession name → gender-unspecified descriptors)

2.2.2. Visual Features Analysis: Clustering-Based Approach

Overall concept : identity dataset의 image embedding을 clustering한 후, profession dataset의 image embedding이 어떤 cluster에 속하는지를 이용하여 social bias를 evaluate

Obtaining image embedding using VQA model

VQA 모델 + question을 이용하여 image embedding을 구함
(BLIP VQA with question "What word best describes this person's appearance?",

the normalized average of the question token embeddings produced by the VQA encoder conditioned on the image)

Person에 집중한 image embedding을 얻기 위해 CLIP image encoder를 사용하지 않고 VQA 모델 사용

Identity dataset에 대해 image embedding을 구한 뒤, 24 regions로 clustering
24 regions를 사용한 이유 : interpretability와 discriminative를 적당히 모두 만족하는 optimal number라서
(optimal number of clusters in terms of distinctiveness and interpretability of the analysis)
Image를 생성했던 prompt를 이용하여, 각 region을 대표하는 gender, ethnicity를 파악 (top-2 gender, top-4 ethnicity)
Profession dataset에 대해 image embedding을 구한 뒤, 어떤 region에 해당하는지 파악
각 profession에 해당하는 이미지들이 어떤 regions에 속하는지를 이용하여 social bias 파악

2.2.3. Interactive Exploration

Ad-hoc in-depth exploration을 할 수 있도록, 다양한 interactive tools 제공
(quantitative insights를 제공하려는 목적 X)

3. Results

3.1. Gender Bias Analysis through Text Markers

Overall concept : profession dataset에 대해 생성한 text descriptions를 이용하여 gender bias 분석

identifying social bias of TTI system using text modality from paper

BLS-provided numbers와 비교했을 때, gender bias가 가장 큰건 DALL-E 2, 가장 작은건 Stable Diffusion v1.4임

Image captions의 97.66%가 gender-marked terms를 포함하고 있는 반면, VQA answers는 45.56%만 포함하고 있음

대부분의 image captions는 full sentences인 반면, 대부분의 VQA answers는 single word prediction이기에 그런것

참고로 gender-neutral terms는 거의 없었으며, non-binary gender marker는 아예 없었음

Example - professions with large discrepancy

Caption, VQA 모두 고려했을 때 discrepancy가 가장 컸던 professions는 다음과 같음

BLS보다 text description에 women 비율이 더 적은 profession

즉, women을 더 적게 생성한 profession
clerk (57/55% less), data entry keyer (55/53% less), real estate broker (52/54% less)

BLS보다 text description에 women 비율이 더 많은 profession

즉, women을 더 많이 생성한 profession
singer (29/36% more), cleaner (20/16% more), dispatcher (19/16% more)

Markedness

Markedness - Wikipedia

From Wikipedia, the free encyclopedia State of standing out as unusual or difficult in comparison to a more common or regular form In linguistics and social sciences, markedness is the state of standing out as nontypical or divergent as opposed to regular

en.wikipedia.org

Markedness: 다른 것들과 구분되는 특징을 가진 것

사람이 image를 labeling한다고 하면, image에서 특징적인 것을 기준으로 text labeling하게 됨

Image captions, VQA answers에 person과 같은 gender-neutral terms이 거의 등장하지 않은 이유를 markedness로 이해할 수 있음

3.2. Gender and Ethnicity Distribution in the Image Space

3.2.1. Characterizing Identity Regions in the Image Space

identity clusters (regions) example from paper

24개의 regions에 대해, top-2 gender & top-4 ethnicity를 이용하여 각 region의 overall identity trend를 파악

Profession dataset을 24 regions에 대해 clustering하여, 전반적인 trend를 rough하게 파악할 수 있음
(e.g. Table 2의 share을 보면 알 수 있듯이, Profession dataset 중 40%가 White man이며 woman은 25.5%밖에 안됨)

identifying social bias on specific job from paper

Figure 2, 3와 같이, 특정 profession에 대한 social bias가 어떤지 + TTI system별 차이를 확인할 수 있음

3.2.2. Gender and Ethnicity Representation across Systems

Method 1

BLS에서 gender, ethnicity를 기준으로 jobs를 rank
(woman for gender, Black for ethnicity)
Jobs를 5 bins로 group을 나눔
각 group에 속하는 jobs에 대해 BLS를 이용하여 woman, Black의 비율을 측정
각 group에 속하는 jobs에 대해 Profession dataset의 images들의 woman, Black의 비율을 측정
(group내의 images가 woman이 top-2 gender인 region에 있는지, Black이 top-4 ethnicity인 region에 있는지의 비율)
Profession dataset에 대한 woman, Black의 비율과 BLS에서의 woman, Black 비율을 비교

위에서 설명했던 방법으로는 TTI system의 social bias가 얼마나 심한지, 그리고 TTI systems간 비교하기 어려움

Method 1을 이용하여, TTI systems의 general bias trends를 수치로 표현하여 Table 3과 같이 비교할 수 있음

Stable Diffusion v1.4이 US distribution과 차이가 가장 적고, DALL-E 2가 가장 큼

11 TTI systems initialized from pre-trained Stable Diffusion from paper

저자들은 추가적으로 HuggingFace Hub에서 가장 많이 다운로드된 11 TTI models에 대해서도 수치를 측정

11 TTI models 모두 pre-trained Stable Diffusion model로 initialize했음에도 불구하고, specific fine-tuning, adaptation process에 따라 social bias의 diversity가 다름

3.3. Interactive Tools for Interactive Exploration

Diffusion Bias Explorer (Figure 5 (a))
prompt에 대한 TTI system의 결과를 보여주어, bias tendency를 눈으로 확인 가
Average Face Comparison Tool (Figure 5 (b))
python package Facer를 이용하여 face detection & alignment를 이용하여 profession별 facial images를 average
facial recognition, classification techniques 없이 생성된 images에 대한 high-level patterns 확인 가능
Nearest Neighbors Explorers: BoVW and Colorfulness (Figure 5 (c))
2가지 방식의 nearest-neighbor lookup tools를 이용하여 생성된 image에 대한 structured exploration
color를 반영한 colorfulness, structural similarity를 반영한 bag-of-visual-words TF-IDF index를 이용
해당 방법들은 external pre-training dataset에 depend하지 않음