PSALM / README.md
hongk1998's picture
ch readme
816b23f
|
raw
history blame
No virus
601 Bytes
metadata
license: apache-2.0

PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model

Features

  • A powerful extension of the Large Multi-modal Model for generic (panoptic, instance, semantic) segmentation, referring segmentation and interactivate segmentation.
  • Support joint training across multiple segmentation tasks and visual-language tasks.
  • Demonstrates zero-shot capabilities on unseen task, such as open-vocabulary segmentation, generalizaed referring segmentation, and video object segmentation.

Note

You need to change mm_vision_tower to your mask2former checkpoint path.