Spaces:
Paused
Paused
Xin Lai
commited on
Commit
·
c48aae5
1
Parent(s):
968fffb
Update README.md
Browse filesFormer-commit-id: 76f1879bbaec5efb00e2aa71c98bc7359a86aa4c
README.md
CHANGED
|
@@ -18,7 +18,7 @@
|
|
| 18 |
- [ ] ReasonSeg Dataset Release
|
| 19 |
- [ ] Training Code Release
|
| 20 |
|
| 21 |
-
**LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/
|
| 22 |
[Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
|
| 23 |
[Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
|
| 24 |
[Yukang Chen](https://scholar.google.com/citations?user=6p0ygKUAAAAJ&hl=en),
|
|
@@ -29,7 +29,7 @@
|
|
| 29 |
|
| 30 |
## Abstract
|
| 31 |
In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
|
| 32 |
-
For more details, please refer to
|
| 33 |
|
| 34 |
## Highlights
|
| 35 |
**LISA** unlocks the new segmentation capabilities of multi-modal LLMs, and can handle cases involving:
|
|
|
|
| 18 |
- [ ] ReasonSeg Dataset Release
|
| 19 |
- [ ] Training Code Release
|
| 20 |
|
| 21 |
+
**LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/abs/2308.00692)]** <br />
|
| 22 |
[Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
|
| 23 |
[Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
|
| 24 |
[Yukang Chen](https://scholar.google.com/citations?user=6p0ygKUAAAAJ&hl=en),
|
|
|
|
| 29 |
|
| 30 |
## Abstract
|
| 31 |
In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
|
| 32 |
+
For more details, please refer to the [paper](https://arxiv.org/abs/2308.00692).
|
| 33 |
|
| 34 |
## Highlights
|
| 35 |
**LISA** unlocks the new segmentation capabilities of multi-modal LLMs, and can handle cases involving:
|