nm000229 NEMAR-native dataset

Gwilliams et al. 2023 — Introducing MEG-MASC: a high-quality magneto-encephalography dataset for evaluating natural speech processing

MEG-MASC is a high-quality magnetoencephalography dataset comprising raw MEG recordings from 27 English speakers listening to approximately two hours of naturalistic stories from the Manually Annotated Sub-Corpus (MASC). The dataset includes precise temporal annotations of word and phoneme onsets/offsets, organized according to the Brain Imaging Data Structure (BIDS) standard. This benchmark dataset enables large-scale encoding and decoding analyses of neural responses to natural speech processing, with accompanying code for validation analyses including temporal decoding of phonetic features and word frequency effects.

AI-generated description, may include mistakes
ANAT MEG

Compute on this dataset

Two routes today, with a third (in-browser one-click submission) landing soon.

  1. NeuroScience Gateway (NSG) portal.

    NSG runs EEGLAB / Brainstorm / MNE pipelines on supercomputing time donated by SDSC. Create an account, point a job at this dataset's S3 prefix (s3://nemar/nm000229), and submit.
    nsgportal.org →

  2. Local processing with nemar-cli.

    Pull the dataset to your machine and run any toolbox locally. Honors the published version pinning.

    npm install -g nemar-cli
    nemar dataset clone nm000229
    cd nm000229 && nemar dataset get
  3. Just the files.

    rclone, aria2c, or any HTTPS client works against data.nemar.org/nm000229/ — the manifest carries presigned S3 URLs.

Direct compute access is coming soon. One-click NSG submission from this page is scoped for a follow-up phase. Tracked on nemarOrg/website#6.

Loading demographics…

Files

Loading file index…