nm000261 NEMAR-native dataset

Imagined speech EEG dataset — vowels condition (Nguyen et al. 2017)

This dataset comprises preprocessed EEG recordings from 8 healthy subjects performing imagined speech tasks involving three vowel phonemes (a, i, u). Participants received auditory and visual cues to imagine speaking each vowel, with 64-channel EEG data acquired at 256 Hz. The dataset includes 2,400 base trials (8 subjects × 300 trials per subject) with 3 overlapping 2-second epochs extracted per trial, yielding 7,200 total epochs, analyzed using Riemannian manifold-based feature extraction and relevance vector machine classification for brain-computer interface applications. This is a MOABB (Mother of All BCI Benchmarks) benchmark collection dataset converted to BIDS format.

EEG

Issues GitHub

Download this dataset

Pick a method. Large datasets skip the zip and use the streaming methods below — all resumable. Full download guide →

Download archive (.zip) — 0.4 GB
A single zip of the published version. Best for small/medium datasets.

Download zip
NEMAR CLI recommended
Pulls the pinned version + annexed data and resumes cleanly. Install nemar-cli →
```
nemar dataset download nm000261
```

DataLad

Clone the dataset repo and fetch file content on demand. Docs →

datalad clone https://github.com/nemarDatasets/nm000261 nm000261
cd nm000261 && datalad get .

git-annex

Plain git + git-annex against the dataset repo. Docs →

git clone https://github.com/nemarDatasets/nm000261 nm000261
cd nm000261 && git annex get .

Direct files (wget / curl / rclone)
Every file with a stable, range-resumable URL from the manifest. Needs curl, jq, wget (or rclone/aria2c). Docs →
```
curl -s https://data.nemar.org/nm000261/v1.0.1/manifest.json | jq -r '.[].bytes_url' > urls.txt
wget -xc -i urls.txt
```

Compute on this dataset

Two routes today, with a third (in-browser one-click submission) landing soon.

NeuroScience Gateway (NSG) portal.
NSG runs EEGLAB / Brainstorm / MNE pipelines on supercomputing time donated by SDSC. Create an account, point a job at this dataset's S3 prefix (s3://nemar/nm000261), and submit.
nsgportal.org →
Local processing with nemar-cli.
Pull the dataset to your machine and run any toolbox locally. Honors the published version pinning.
```
npm install -g nemar-cli
nemar dataset clone nm000261
cd nm000261 && nemar dataset get
```
Just the files.
rclone, aria2c, or any HTTPS client works against data.nemar.org/nm000261/ — the manifest carries presigned S3 URLs.

Direct compute access is coming soon. One-click NSG submission from this page is scoped for a follow-up phase. Tracked on nemarOrg/website#6.

Loading demographics…

Files

Loading file index…

Cite this dataset

DOI 10.82901/nemar.nm000261

Nguyen, C. H., Karavas, G. K., & Artemiadis, P. (2026). Imagined speech EEG dataset — vowels condition (Nguyen et al. 2017) (Version v1.0.1) [Data set]. NEMAR. https://doi.org/10.82901/nemar.nm000261

License

other-open

Modalities

EEG

Tasks

imagery

BIDS datatypes

eeg

Sessions

Published

Jun 23, 2026 2 hours ago

Authors

Chuong H. Nguyen
George K. Karavas
Panagiotis Artemiadis

Keywords

imagined speech EEG brain-computer interface motor imagery Riemannian manifold covariance matrix vowels relevance vector machines

Related identifiers

IsDerivedFrom 10.1088/1741-2552/aa8235
References 10.21105/joss.01896
References 10.1038/s41597-019-0104-8
IsDescribedBy github.com/nemarDatasets/nm000261…
IsDescribedBy nemar.org/dataexplorer/detail?dataset_id=nm000261…