▶ NLP
- (LSTM)
- transformer
- BERT
- GPT
▶ Vision AI
Visual Trnasformer (ViT)
Generative model
- Diffusion
- GAN
Foundation model
- segment anything (잘 정리된 블로그)
- task : prompt 기반 모델
- model : image encoder(MAE pre-trained ViT), prompt encoder(sparse, dense), mask decoder
- data : 데이터 엔진.
이외에..
- Object Detection
- R-CNN, Fast R-CNN, Faster R-CNN, YOLO, SSD - Segmentation
- FCN(fully convolutional network), U-Net, SegNet, DeepLab - Image Captioning
- LSTM + CNN
- NIC (Neural Image Captioning), Show and Tell, Up-Down
'Data science > AI' 카테고리의 다른 글
micro average F1-score와 accuracy가 같은 이유 (0) | 2023.06.16 |
---|---|
SHAP (SHapley Additive exPlanations) (0) | 2023.06.16 |
bagging vs. boosting (0) | 2023.06.15 |
이미지 노이즈 종류 및 제거(Image Denoising)/ Gaussian, Median, Bilateral, NLmeans (0) | 2023.04.08 |