Spaces:
Paused
Paused
Update README.md
Browse files
README.md
CHANGED
|
@@ -1 +1,35 @@
|
|
| 1 |
-
# LISA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# LISA: Reasoning Segmentation Via Large Language Model
|
| 2 |
+
|
| 3 |
+
This is the official implementation of ***LISA (Large-language Instructed Segmentation Assistant)***. In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
|
| 4 |
+
For more details, please refer to:
|
| 5 |
+
|
| 6 |
+
**LISA: Reasoning Segmentation Via Large Language Model [[Paper]()]** <br />
|
| 7 |
+
[Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
|
| 8 |
+
[Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
|
| 9 |
+
[Yukang Chen](https://scholar.google.com/citations?user=6p0ygKUAAAAJ&hl=en),
|
| 10 |
+
[Yanwei Li](https://scholar.google.com/citations?user=I-UCPPcAAAAJ&hl=zh-CN),
|
| 11 |
+
[Yuhui Yuan](https://scholar.google.com/citations?user=PzyvzksAAAAJ&hl=en),
|
| 12 |
+
[Shu Liu](https://scholar.google.com.hk/citations?user=BUEDUFkAAAAJ&hl=zh-CN)
|
| 13 |
+
[Jiaya Jia](https://scholar.google.com/citations?user=XPAkzTEAAAAJ&hl=en)<br />
|
| 14 |
+
|
| 15 |
+
<p align="center"> <img src="docs/VoxelNeXt-Pipeline.png" width="100%"> </p>
|
| 16 |
+
|
| 17 |
+
### Experimental results
|
| 18 |
+
|
| 19 |
+
|
| 20 |
+
## Citation
|
| 21 |
+
If you find this project useful in your research, please consider citing:
|
| 22 |
+
|
| 23 |
+
```
|
| 24 |
+
@article{reason-seg,
|
| 25 |
+
title={LISA: Reasoning Segmentation Via Large Language Model},
|
| 26 |
+
author={Xin Lai and Zhuotao Tian and Yukang Chen and Yanwei Li and Yuhui Yuan and Shu Liu and Jiaya Jia},
|
| 27 |
+
journal={arXiv:},
|
| 28 |
+
year={2023}
|
| 29 |
+
}
|
| 30 |
+
|
| 31 |
+
```
|
| 32 |
+
|
| 33 |
+
|
| 34 |
+
## Acknowledgement
|
| 35 |
+
- This work is built upon the [LLaMA](https://github.com/facebookresearch/llama), [SAM](https://github.com/facebookresearch/segment-anything), and LLaVA(https://github.com/haotian-liu/LLaVA).
|