Upload datasets
How to upload datasets to the PortexAI Datalab
Need access to seller privileges? Submit a request in the data studio and we'll get back to you.
Adding a new dataset
Users on the PortexAI Datalab with data seller permissions can upload datasets through the data studio.
To start, pick the format that matches your data:
Tabular data → upload a
.parquet
file for faster queries and built-in compression.Multi-modal data (for example, images + text + labels) → bundle everything into a single
.tar.gz
or.gz
archive to keep files together.
For the Init Commit message, add something like initial upload
; every future update gets its own commit, giving you a transparent change log buyers can trust. Note that upload speeds will vary based on your local connection.
By default, you can host multiple datasets up to a combined 20 GB. If you're working with something larger simply reach out to us we can work with you to provision more storage for your account. To list a “collection” (several related datasets in one listing), zip them into a single .tar.gz
archive first.

Removing a dataset
To remove a dataset, enter the data studio and navigate to the datasets
tab. From here, you can select an existing dataset and delete it. You will be asked to enter your password to confirm this action as this action cannot be undone.

Last updated