LangSpace_reasoning

Abstract

This project explores spatial reasoning with large language models (LLMs) over 3D scene graphs to enable high-level task planning from natural language instruc- tions. Previous works, such as SayPlan [1] and SayCan [2], have demonstrated the potential of LLMs in robotic planning, but rely on frontier models like ChatGPT-4, which are computationally expensive and require internet access. In this work, I investigate fine-tuning open-source LLMs (Llama3 1/3/8B [3]) for task planning over structured scene representations. I build my own 3D scene graph task planning system to assess alternative inference strategies, such as single-shot and iterative multi-prompting and to generate training data for SFT and PPO fine tuning. The goal is to determine whether a fine-tuned, locally deployable model can achieve planning performance comparable to or exceeding that of general-purpose frontier LLMs while improving efficiency. I will present experimental results evaluat- ing the impact of representation choices, prompting techniques, and fine-tuning configurations on plan quality and execution feasibility.

Check LangSpace_reasoning.pdf for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
eval		eval
lora_config		lora_config
src		src
LangSpace_reasoning.pdf		LangSpace_reasoning.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LangSpace_reasoning

Abstract

About

Uh oh!

Releases

Packages

Languages

laszilagyi/LangSpace_reasoning

Folders and files

Latest commit

History

Repository files navigation

LangSpace_reasoning

Abstract

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages