'Explainable AI' 카테고리의 글 목록

Explainable AI

[ECCV 2014] Visualizing and Understanding Convolutional Networks (DeConvolution)

2025.04.21

오늘은 deconvolution에 대해 다뤄보려고 한다. 편하게 그냥 deconv라고 하겠다. 이름에서부터 알 수 있듯이 CNN 속의 과정을 탐구하는 방법론이며, 이미지를 받아서 줄이고 줄이고 줄여가는 CNN과 반대로 우리가 관심이 있는 어떤 단계의 줄여진 친구를 다시 늘리고 늘리고 늘려가면서 최종적으로 히트맵을 출력, 특정 단계에서의 CNN이 어떤 부분에 집중하고 있는지를 알 수 있는 방법론이다. 논문 원문은 다음과 같다.https://arxiv.org/abs/1311.2901 Visualizing and Understanding Convolutional NetworksLarge Convolutional Network models have recently demonstrated impressive c..

Explainable AI

[논문리뷰] A Modified Perturbed Sampling Method for Local Interpretable Model-agnostic Explanation (MPS-LIME)

2025.04.18

LIME을 같이 다뤘었는데, (블로그 이전 글 참조) LIME의 slight modification인 MPS-LIME이 재미있어 보여서 가지고 왔다. 원문링크:https://arxiv.org/abs/2002.07434 A Modified Perturbed Sampling Method for Local Interpretable Model-agnostic ExplanationExplainability is a gateway between Artificial Intelligence and society as the current popular deep learning models are generally weak in explaining the reasoning process and prediction res..

Explainable AI

[ICML 2018] Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV)

2025.04.11

여태까지는 Local explanation에 해당하는 XAI 기법들만 다뤄보았다. Activation Maximization, Saliency Map Visualization, Layer-wise Relevance BackPropagation, LIME(Local Interpretable model-Agnostic Explanations), SHAP(SHapley Additive exPlanations) 모두 Local explanation 기법이다. 지금부터는 Global explanation에 해당하는 XAI 기법들을 다뤄볼 건데, 그 시작은 바로 TCAV(Testing with Concept Activation Vectors)이다. 참고로, Local explanation 기법들은 하나의 데이터..

Explainable AI

[NIPS 2017] A Unified Approach to Interpreting Model Predictions (SHAP)

2025.04.08

오늘은 XAI framework 중 하나인 SHAP에 대해 다뤄볼 것이다. SHAP는 LIME 이후에 나온 이론적 framework로, 여태까지 이 블로그에서 다루었던 LIME, LRP 뿐만 아니라, DeepLIFT, 그리고 다른 Shapeley-valued method들을 모두 통합하는 하나의 framework이다. 그래서 논문 이름도 "A Unified Approach to Interpreting Model Predictions"이다. 위의 방법론들을 통합하는 하나의 framework이기 때문이다. 대략적으로 어떤 느낌의 방법론이냐면, LIME에서도 그랬듯이 복잡한 문제를 local의 범위에서 linear function으로 간략화하여 다루는 방법론의 큰 틀을 제시했다고 보면 될 것 같다. 원본 ..

Explainable AI

[KDD 2016] "Why Should I Trust You?": Explaining the Predictions of Any Classifier (LIME)

2025.04.01

오늘은 XAI 분야에서 유명한 논문 중 하나인, LIME에 대해서 다뤄보고자 한다. LIME은 Local Interpretable Model-Agnostic Explanations의 줄임말이다. 제목을 보면 이 모델의 특징을 잘 알 수 있는데, 1. Local한 부분을 이용한다.2. Model-Agnostic 하다. 라는 큰 특징 2가지가 보인다. 원문은 다음과 같다(Ribeiro et al., 2016).https://arxiv.org/abs/1602.04938 "Why Should I Trust You?": Explaining the Predictions of Any ClassifierDespite widespread adoption, machine learning models remain mos..

Explainable AI

[논문리뷰] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation (LRP)

2025.03.26

이전에 다뤘던 Activation Maximization, Saliency Map 등은 각 hidden layer의 feature map을 시각화하여 설명가능한 AI를 구현하는 방식이다. 하지만 이는 깊은 layer일수록 해석이 힘들고, 사람마다 해석이 다를 수 있으며, 약간 애매모하다는 단점이 존재한다. 이번에 다룰 LRP는 HeatMap 방식의 설명을 제공하는 방법론 중 대표적인 방법론이며, 모델의 결과를 역추적해서 입력 이미지에 HeatMap을 생성하는 방식이다. https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0130140 On Pixel-Wise Explanations for Non-Linear Classifier Deci..

Explainable AI

[ICML 2017 Workshop] SmoothGrad: Removing noise by adding noise

2025.03.26

이번에는 저번 논문리뷰인 saliency map을 조금 더 발전시킨, SmoothGrad라는 기법에 대해 알아보려고 한다. https://arxiv.org/abs/1706.03825 SmoothGrad: removing noise by adding noiseExplaining the output of a deep network remains a challenge. In the case of an image classifier, one type of explanation is to identify pixels that strongly influence the final decision. A starting point for this strategy is the gradient of the class sc..

Explainable AI

[ICLR 2014 Workshop] Deep Inside Convolutional Networks: Visualizing Image Classification Models and Saliency Maps (Saliency Map)

2025.03.26

이번에는 Saliency Map Visualisation을 통해서 시각화를 하는 방법을 알아볼 것이다."Deep Inside Convolutional Networks: Visualizing Image Classification Models and Saliency Maps" 라는 논문이다. 논문 원문은 https://arxiv.org/abs/1312.6034 Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency MapsThis paper addresses the visualisation of image classification models, learnt using deep Convolutional N..

Explainable AI

[논문리뷰] Activation Maximization

2025.03.25

AI model들은 항상 black-box라는 문제를 가지고 있다. 이 문제로 인해 모델 안에서 어떤 일들이 벌어지고 있는지를 직접 시각화하고 알아내고자 하는 시도들이 많이 있었는데, Activation Maximization (Maximization by optimization) 도 그 시도들 중에 하나이다. 특히 Activation Maximization은 CNN에서 각 Feature Map들이 표현하는 시각적 특징이 무엇인지를 시각화하려는 시도이다. CNN의 구조 중 하나를 "target"으로 고정시켜놓고, 이를 최대로 활성화시키는 입력 이미지를 찾거나 생성하는 방법이다. (여기서는 생성하는 방식을 다루도록 하겠다)참조: - Olah, et al., "Feature Visualization", D..

Explainable AI

[NeurIPS 2021] Neural Additive Models: Interpretable Machine Learning with Neural Nets

2024.09.05

Interpretable Modeling 관련하여 찾아던 중 Neural Additive Models: Interpretable Machine Learning with Neural Nets 이라는 논문을 찾아 간단하게 공부하게 되었다. (Spotlight (Top 3%) at NeurIPS 2021) Deep Neural Networks(DNN)이 black-box이기 때문에 healthcare, finance, criminal justice와 같이 고위험군 task에서는 적용가능성에 문제점이 있기에 더욱 더 interpretation 가능한 방법론이 필요하다고 말한다. 기존의 방법론인 LIME과 같은 방법론들은 모델의 behavior에 대한 설명 디테일이 부족하거나, fail to provide a..

티스토리툴바