Import Your Dataset

Dataverse streamlines your data management processes, making it easier to organize and visualize your data.

Dataverse accepts various data formats, and you can choose opendataset as your data source.

Quick Start

Starting with a bunch of images or videos? Try folder upload. And if you're starting from scratch, pop in prepared open datasets and get rolling.

Click on the video below for a quick overview:

Dataset List

On the Dataset page, you can view a list of all uploaded datasets and their status.

Click "Detail" to view the details of this dataset, including whether the upload was successful or if an error message is displayed.

Hint

The system supports resuming from breakpoints.

Clicking "View in Data Visualization" will display all files in this folder.

Import Dataset

The Linker platform supports several methods, allowing you to choose your preference for uploading data.

Click on "Import," select the data format you want to upload and enter the data upload settings.

  • Upload from Local: upload directly from local folder.

  • Raw Data: data without annotations

  • Annotated Data: data with annotations

  • Existing: import dataset from other project.

  • Open Dataset: import preselceted dataset.

  • SDK: Use SDK to upload from the cloud or locally.

Good to know: Starting with a bunch of images or videos? Try folder upload. And if you're starting from scratch, pop in prepared open datasets and get rolling.

For different options, different data content may be required to assist in data uploading.

On the Dataset page, click "Import," choose the target based on the uploaded data content, and enter the data upload settings.


Data Upload from Local

Browse and select images or videos directly from your local folder for easy importation into the platform.

  1. Image upload

  2. Video upload

Only single-camera projects are suitable for fast folder upload.


Annotated Data Upload Format

The platform provides the following data upload formats:

  1. VisionAI data format - object detection

After filling in the upload format, indicate whether the data is sequence data or whether it is uploaded only for the results of specific sensors.

After clicking "Import," the system will save the file as a database according to the settings, making it easier to filter and search in the future.

Uploading through AWS

Connect to your AWS S3 bucket, and the file will be transferred to the platform.。

  • Bucket URL: AWS S3 bucket link (Bucket URL)

  • Data Folder: Location of the file folder that needs to be imported.

  • Access Key ID & Secret Access Key (Optional): Use your Access Key ID and Secret Access Key for secure AWS S3 operations. These keys authenticate and authorize actions like retrieving or listing objects. Remember, specific permissions (e.g.,GetObject,ListObjects,ListObjectsV2,CopyObject) are required for this operations.

Amazon S3 virtual-hosted–style URLs use the following format:

https://bucket-name.s3.region-code.amazonaws.com/

Hint: The Access Key ID and Secret Access Key fields are optional for public AWS S3 buckets. If your S3 bucket is set to public, you can leave these fields blank.

Uploading through Azure

Connect to your Azure blob storage, and the file will be transferred to the platform.

  • Blob Storage URL: Azure storage link (Blob storage account URL)

  • Container Name: Blob storage container name

  • Data Folder: Location of the data folder stored in the container that needs to be imported.

  • SAS Token: SAS token that provides read permissions to the Azure blob storage account.

Please provide the SAS token for your Azure Blob Storage account, not the container, when uploading.


Import Open Dataset

Import pre-selected open datasets directly into your workspace for efficient model training and data analysis.


Adding dataset using Python SDK.

Use Dataverse-SDK for Python to help you to interact with the Dataverse platform by Python. Currently, the library supports:

Install Package

git clone https://github.com/linkervision/dataverse-sdk.git
pip install dataverse-sdk

Quick Tools

Import Dataset

python tools/import_dataset_from_local.py -host https://visionai.linkervision.ai/dataverse/curation -e {your-account-email} -p {PASSWORD} -s {service-id}  -project {project-id} --folder {/YOUR/TARGET/LOCAL/FOLDER} -name {dataset-name} -type {raw_data OR annotated_data} -anno {image OR vision_ai} --sequential

Export Dataslice and download files

python tools/export_dataslice.py -host https://visionai.linkervision.ai/dataverse/curation  -e {your-account-email} -p {PASSWORD} -s {service-id} -dataslice {dataslice_id} -f {/YOUR/TARGET/LOCAL/file.zip}

To use the SDK, you need to provide service-id.

When you select SDK in Import Dataset, your Service ID will be displayed in the content, which can be copied and used.


Import from Existing Data

The system offers the option to import a dataset from an existing dataset of another project. At this point, an ontology mapping is required to ensure data compatibility.


Advanced Image Processing

Point Cloud File 2D Preview Image

When generating 2D preview images, the system can batch-generate 2D preview images for point cloud data, which helps with later visualization browsing.

IQA for Image Analysis Results

When the automatic image quality assessment generation function is turned on, the system will batch-generate data for each image, including brightness, contrast, and other information. For details, please refer to the documentation.

Image Quality Assessment (IQA)

Auto-Tagging

Dataverse offers an advanced auto-tagging feature to simplify data management, automatically generating image tags (weather, scene, time of day) for users.

Auto-Tagging

Last updated