nm000251 NEMAR-native dataset

He et al. 2025 — VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language

VocalMind is a stereotactic EEG (sEEG) dataset comprising intracranial recordings from one participant performing vocalized, mimed, and imagined speech tasks in Mandarin Chinese, a tonal language. The dataset supports research on speech decoding, brain-computer interfaces, and neural mechanisms of overt and covert speech production. Raw recordings sampled at 1000 Hz were converted to BIDS iEEG format with event markers indicating stimulus onsets for each speech modality.

IEEG

Issues GitHub

Download this dataset

Pick a method. Large datasets skip the zip and use the streaming methods below — all resumable. Full download guide →

Download archive (.zip) — 1.2 GB
A single zip of the published version. Best for small/medium datasets.

Download zip
NEMAR CLI recommended
Pulls the pinned version + annexed data and resumes cleanly. Install nemar-cli →
```
nemar dataset download nm000251
```

DataLad

Clone the dataset repo and fetch file content on demand. Docs →

datalad clone https://github.com/nemarDatasets/nm000251 nm000251
cd nm000251 && datalad get .

git-annex

Plain git + git-annex against the dataset repo. Docs →

git clone https://github.com/nemarDatasets/nm000251 nm000251
cd nm000251 && git annex get .

Direct files (wget / curl / rclone)
Every file with a stable, range-resumable URL from the manifest. Needs curl, jq, wget (or rclone/aria2c). Docs →
```
curl -s https://data.nemar.org/nm000251/v1.0.0/manifest.json | jq -r '.[].bytes_url' > urls.txt
wget -xc -i urls.txt
```

Compute on this dataset

Two routes today, with a third (in-browser one-click submission) landing soon.

NeuroScience Gateway (NSG) portal.
NSG runs EEGLAB / Brainstorm / MNE pipelines on supercomputing time donated by SDSC. Create an account, point a job at this dataset's S3 prefix (s3://nemar/nm000251), and submit.
nsgportal.org →
Local processing with nemar-cli.
Pull the dataset to your machine and run any toolbox locally. Honors the published version pinning.
```
npm install -g nemar-cli
nemar dataset clone nm000251
cd nm000251 && nemar dataset get
```
Just the files.
rclone, aria2c, or any HTTPS client works against data.nemar.org/nm000251/ — the manifest carries presigned S3 URLs.

Direct compute access is coming soon. One-click NSG submission from this page is scoped for a follow-up phase. Tracked on nemarOrg/website#6.

Loading demographics…

Files

Loading file index…

Cite this dataset

DOI 10.82901/nemar.nm000251

He, T., Wei, M., Wang, R., Wang, R., Du, S., Cai, S., Tao, W., & Li, H. (2026). He et al. 2025 — VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language (Version v1.0.0) [Data set]. NEMAR. https://doi.org/10.82901/nemar.nm000251

License

CC BY 4.0

Modalities

IEEG

Tasks

imagined mimed vocalized

BIDS datatypes

ieeg

Published

Jun 17, 2026 2 days ago

Authors

Tianyu He
Mingyi Wei
Ruicong Wang
Renzhi Wang
Shiwei Du
Siqi Cai

+ 2 more

Wei Tao
Haizhou Li

Funding

Keywords

stereotactic EEG intracranial EEG speech decoding brain-computer interfaces imagined speech tonal language Mandarin Chinese

Related identifiers

IsDerivedFrom 10.5281/zenodo.14696348
IsDerivedFrom 10.1038/s41597-025-04741-2
IsDescribedBy github.com/nemarDatasets/nm000251…
IsDescribedBy nemar.org/dataexplorer/detail?dataset_id=nm000251…