Import Your Dataset
Dataverse streamlines your data management processes, making it easier to organize and visualize your data.
Dataverse accepts various data formats, and you can choose opendataset as your data source.
Quick Start
Starting with a bunch of images or videos? Try folder upload. And if you're starting from scratch, pop in prepared open datasets and get rolling.
Click on the video below for a quick overview:
Dataset List
On the Dataset page, you can view a list of all uploaded datasets and their status.

Click "Detail" to view the details of this dataset, including whether the upload was successful or if an error message is displayed.

Clicking "View in Data Visualization" will display all files in this folder.

Import Dataset
The Linker platform supports several methods, allowing you to choose your preference for uploading data.
Click on "Import," select the data format you want to upload and enter the data upload settings.
Upload from Local: upload directly from local folder.
Raw Data: data without annotations
Annotated Data: data with annotations
Existing: import dataset from other project.
Open Dataset: import preselceted dataset.
SDK: Use SDK to upload from the cloud or locally.

On the Dataset page, click "Import," choose the target based on the uploaded data content, and enter the data upload settings.
Data Upload from Local
Browse and select images or videos directly from your local folder for easy importation into the platform.
Image upload
Video upload

Annotated Data Upload Format
The platform provides the following data upload formats:
VisionAI data format - object detection
VLM data format - VQA
Coco format: https://cocodataset.org/#format-data
After filling in the upload format, indicate whether the data is sequence data or whether it is uploaded only for the results of specific sensors.
After clicking "Import," the system will save the file as a database according to the settings, making it easier to filter and search in the future.
Uploading through AWS
Connect to your AWS S3 bucket, and the file will be transferred to the platform.。
Bucket URL: AWS S3 bucket link (Bucket URL)
Data Folder: Location of the file folder that needs to be imported.
Access Key ID & Secret Access Key (Optional): Use your Access Key ID and Secret Access Key for secure AWS S3 operations. These keys authenticate and authorize actions like retrieving or listing objects. Remember, specific permissions (e.g.,GetObject,ListObjects,ListObjectsV2,CopyObject) are required for this operations.

Uploading through Azure
Connect to your Azure blob storage, and the file will be transferred to the platform.
Blob Storage URL: Azure storage link (Blob storage account URL)
Container Name: Blob storage container name
Data Folder: Location of the data folder stored in the container that needs to be imported.
SAS Token: SAS token that provides read permissions to the Azure blob storage account.

Import Open Dataset
Import pre-selected open datasets directly into your workspace for efficient model training and data analysis.
Coco dataset: https://cocodataset.org/#home
KITTI dataset:https://www.cvlibs.net/datasets/kitti-360/index.php
LVIS dataset:https://www.lvisdataset.org/
BDD dataset: https://bdd-data.berkeley.edu/

Adding dataset using Python SDK.
Use Dataverse-SDK for Python to help you to interact with the Dataverse platform by Python. Currently, the library supports:
Install Package
git clone https://github.com/linkervision/dataverse-sdk.git
pip install dataverse-sdk
Quick Tools
Import Dataset
python tools/import_dataset_from_local.py -host https://visionai.linkervision.ai/dataverse/curation -e {your-account-email} -p {PASSWORD} -s {service-id} -project {project-id} --folder {/YOUR/TARGET/LOCAL/FOLDER} -name {dataset-name} -type {raw_data OR annotated_data} -anno {image OR vision_ai} --sequential
Export Dataslice and download files
python tools/export_dataslice.py -host https://visionai.linkervision.ai/dataverse/curation -e {your-account-email} -p {PASSWORD} -s {service-id} -dataslice {dataslice_id} -f {/YOUR/TARGET/LOCAL/file.zip}
Firstly, prepare the target project and its ontology. + Create Project
Import from Existing Data
The system offers the option to import a dataset from an existing dataset of another project. At this point, an ontology mapping is required to ensure data compatibility.

Advanced Image Processing
Point Cloud File 2D Preview Image
When generating 2D preview images, the system can batch-generate 2D preview images for point cloud data, which helps with later visualization browsing.

IQA for Image Analysis Results
When the automatic image quality assessment generation function is turned on, the system will batch-generate data for each image, including brightness, contrast, and other information. For details, please refer to the documentation.
Image Quality Assessment (IQA)Auto-Tagging
Dataverse offers an advanced auto-tagging feature to simplify data management, automatically generating image tags (weather, scene, time of day) for users.
Auto-TaggingLast updated