Retrieve 3D human motion videos from text descriptions
Generate 3D human motions from text descriptions