Spaces:
Sleeping
Sleeping
| # pylint: disable=C0301 | |
| TEMPLATE_EN: str = """You are a senior VQA data engineer. Your task is to generate logically coherent, verifiable and non-hallucinated question-answer pairs for the given multi-modal samples. | |
| Use English as the output language. | |
| ---Objectives--- | |
| Create multiple sets of VQA question-answer pairs that satisfy the following: | |
| 1. Only ask about objectively existing facts in the given data, avoiding subjective or ambiguous questions. | |
| 2. Ensure that each question has a clear and verifiable answer, avoiding questions with no answer or uncertainty. | |
| 3. Questions should cover various aspects of both image and text content, ensuring diversity and comprehensiveness. | |
| 4. Avoid repetitive questions, ensuring that each question is unique and meaningful. | |
| 5. Use clear and concise language, avoiding complex or ambiguous wording. | |
| ---Instructions--- | |
| 1. Carefully analyze the provided entities and relationships to identify: | |
| - Key concepts and their hierarchical relationships | |
| - Temporal sequences and time order | |
| - Cause-and-effect relationships | |
| - Dependencies between different elements | |
| 2. Organize the information into a logical sequence by: | |
| - Starting with foundational concepts | |
| - Gradually building up to more complex relationships | |
| - Grouping related ideas together | |
| - Creating clear transitions between sections | |
| 3. Maintain the following when generating question-answer pairs: | |
| - Logical flow | |
| - Clear connections between concepts | |
| - Appropriate context and background | |
| - Coherent narrative structure | |
| 4. Review and refine the question-answer pairs to ensure: | |
| - Overall logical consistency | |
| - Clear cause-and-effect relationships | |
| ################ | |
| -Entities- | |
| ################ | |
| {entities} | |
| ################ | |
| -Relationships- | |
| ################ | |
| {relationships} | |
| ################ | |
| Directly output the generated questions and answers, please do not directly copy the example questions and answers, and do not provide irrelevant information. | |
| Here is the response format you should follow: | |
| Question: <Question1> | |
| Answer: <Answer1> | |
| Question: <Question2> | |
| Answer: <Answer2> | |
| """ | |
| TEMPLATE_ZH: str = """---角色--- | |
| 你是一位资深 VQA 数据工程师。你需要为给定的多模态样本生成逻辑连贯、可验证、无幻觉的问答对。 | |
| 使用中文作为输出语言。 | |
| ---目标--- | |
| 创建多组 VQA 问答对,满足: | |
| 1. 仅询问给定数据中客观存在的事实,避免主观或模糊的问题。 | |
| 2. 确保每个问题都有明确且可验证的答案,避免无答案或不确定的问题。 | |
| 3. 问题应涵盖图像和文本内容的各个方面,确保多样性和全面性。 | |
| 4. 避免重复问题,确保每个问题都是独特且有意义的。 | |
| 5. 使用清晰简洁的语言,避免复杂或含糊的措辞。 | |
| ---说明--- | |
| 1. 仔细分析提供的实体和关系,以识别: | |
| - 关键概念及其层级关系 | |
| - 时间序列和时间顺序 | |
| - 因果关系 | |
| - 不同元素之间的依赖关系 | |
| 2. 通过以下方式将信息组织成逻辑顺序: | |
| - 从基础概念开始 | |
| - 逐步建立更复杂的关系 | |
| - 将相关的想法分组在一起 | |
| - 在各部分之间创建清晰的过渡 | |
| 3. 生成问答对时保持: | |
| - 逻辑流畅 | |
| - 概念之间的清晰联系 | |
| - 适当的上下文和背景 | |
| - 连贯的叙述结构 | |
| 4. 检查和完善问答对以确保: | |
| - 整体逻辑一致性 | |
| - 清晰的因果关系 | |
| ################ | |
| -实体- | |
| ################ | |
| {entities} | |
| ################ | |
| -关系- | |
| ################ | |
| {relationships} | |
| ################ | |
| 直接输出生成的问题和答案,请不要直接复制示例问题和答案,不要输出无关内容。 | |
| 以下是你应该遵循的响应格式: | |
| 问题: <问题1> | |
| 答案: <答案1> | |
| 问题: <问题2> | |
| 答案: <答案2> | |
| """ | |
| VQA_GENERATION_PROMPT = {"en": TEMPLATE_EN, "zh": TEMPLATE_ZH} | |