Pointer-CAD teaser

Pointer-CAD vs. traditional CAD command sequences

Traditional sequence-based representations cannot directly reference geometric entities like edges or faces and are prone to discretization errors. Pointer-CAD instead uses edge pointers to directly operate on B-rep geometry, enabling precise operations such as sketch snapping, reducing quantization artifacts, and better following complex text instructions.

Abstract

Recent LLM-based CAD generation methods represent models as command sequences, but they struggle to reference geometric entities (e.g., faces or edges), limiting complex edits such as chamfer and fillet and introducing quantization-induced topological errors.

We propose Pointer-CAD, a novel LLM-based framework that introduces a pointer-based command representation to explicitly model geometric relationships in B-rep CAD models. By conditioning each operation on both the textual description and the intermediate geometry, Pointer-CAD enables precise entity selection through pointer prediction, improving editing accuracy while reducing quantization errors.

Contributions

Pointer-based CAD Representation

A pointer-based command sequence representation enabling edge and face selection. This makes advanced operations like chamfer and fillet feasible for autoregressive methods and reduces quantization errors;

Pointer-CAD Framework

An LLM-based text-to-CAD framework built on the proposed representation. It employs a multi-step generation strategy, where each step is conditioned on both the textual description and the B-rep generated from previous ste

Generation Comparison

Models with Basic Operations (Sketch & Extrude)

Models with Complex Operations (Fillet & Chamfer)

Generation Showcase

Complex CAD models generated by Pointer-CAD

How Pointer-CAD Works

Pointer-CAD generates precise CAD models in a step-by-step procedural manner, where each generation step predicts only one fundamental CAD operation (Sketch, Extrude, Chamfer, or Fillet).

Pointer-CAD overview

Why step-wise generation?

Unlike previous command-sequence approaches that generate the entire modeling sequence at once, Pointer-CAD decomposes the process into multiple structured steps.
Each step focuses on predicting a single CAD operation, making the generation process more stable, interpretable, and suitable for complex designs.

How Pointer-CAD works

At every generation step, Pointer-CAD relies on three key components:

  • 🔗 Multimodal Fusion Module
    Takes the full text prompt together with the current B-rep geometry and fuses these modalities into a unified representation.

  • 🧠 Large Language Model (LLM)
    Predicts the command sequence for the current CAD operation based on the fused representation.

  • 📐 Vector Translation Module
    Converts the predicted command sequence into an updated B-rep, which serves as the input for the next step.

Key idea

By conditioning each step on both the text instruction and the intermediate geometry, Pointer-CAD enables accurate geometric reasoning and supports complex CAD operations that are difficult for previous autoregressive approaches.

Citation

If you find our work useful, please cite:

@article{qi2026pointer,
  title={Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges \& Faces Selection},
  author={Qi, Dacheng and Wang, Chenyu and Xu, Jingwei and Chu, Tianzhe and Zhao, Zibo and Liu, Wen and Ding, Wenrui and Ma, Yi and Gao, Shenghua},
  journal={arXiv preprint arXiv:2603.04337},
  year={2026}
}