SuKIIII2 (LuoHe)

upvoted a paper 3 months ago

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

Paper • 2509.06806 • Published Sep 8 • 63

upvoted 2 papers 6 months ago

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

Paper • 2505.23693 • Published May 29 • 54

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published May 29 • 93

upvoted a paper 9 months ago

Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning

Paper • 2503.07002 • Published Mar 10 • 39

upvoted 16 papers 10 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 191

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published Jan 24 • 32

Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding

Paper • 2401.15708 • Published Jan 28, 2024 • 12

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Paper • 2401.15071 • Published Jan 26, 2024 • 37

Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI

Paper • 2401.14019 • Published Jan 25, 2024 • 23

Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All

Paper • 2401.13795 • Published Jan 24, 2024 • 68

Rethinking Patch Dependence for Masked Autoencoders

Paper • 2401.14391 • Published Jan 25, 2024 • 26

CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion

Paper • 2401.14066 • Published Jan 25, 2024 • 11

LuoHe

AI & ML interests

Organizations

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

Table-R1: Inference-Time Scaling for Table Reasoning

Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding

StableIdentity: Inserting Anybody into Anywhere at First Sight

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Generative Expressive Robot Behaviors using Large Language Models

TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Learning Universal Predictors

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI

Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All

Rethinking Patch Dependence for Masked Autoencoders

CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion

LuoHe

AI & ML interests

Organizations

SuKIIII2's activity