Skip to content

chore(ci): use tpchgen-cli for generating the tpch dataset#1384

Closed
mesejo wants to merge 1 commit intoapache:mainfrom
mesejo:chore/use_tpchgen-cli
Closed

chore(ci): use tpchgen-cli for generating the tpch dataset#1384
mesejo wants to merge 1 commit intoapache:mainfrom
mesejo:chore/use_tpchgen-cli

Conversation

@mesejo
Copy link
Copy Markdown
Contributor

@mesejo mesejo commented Feb 18, 2026

Which issue does this PR close?

Closes #1120.

Are there any user-facing changes?

N/A

@timsaucer
Copy link
Copy Markdown
Member

Thanks for taking this on! I think the new tool has an option to output parquet files directly, so we might be able to skip the entire data conversion script.

@kevinjqliu
Copy link
Copy Markdown
Contributor

yea we can probably replace all usage of tpch-gen.sh
https://github.com/search?q=repo%3Aapache%2Fdatafusion-python%20tpch-gen.sh&type=code

also with uv, you can just do

uvx tpchgen-cli --help

uvx installs the library automatically

@timsaucer
Copy link
Copy Markdown
Member

Superceded as part of #1443

@timsaucer timsaucer closed this Mar 26, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Update CI to use rust based tpc-h data generator

3 participants