on004952
NEMAR copy of ds004952

ChineseEEG: A Chinese Linguistic Corpora EEG Dataset for Semantic Alignment and Neural Decoding

ChineseEEG is a high-density EEG dataset with simultaneous eye-tracking recordings from 10 participants silently reading Chinese novels for approximately 11 hours each. The dataset includes raw and preprocessed EEG data at multiple filtering levels, eye-tracking data, Chinese text materials from two novels (The Little Prince and Garnett Dream), and BERT-based Chinese text embeddings. This resource enables investigation of semantic alignment between neural representations and natural language processing model embeddings during Chinese language comprehension.

AI-generated description, may include mistakes
Issues GitHub OpenNeuro ds004952

Download this dataset

dataset 696.7 GB exceeds 100.0 GB archive limit; use direct download. Use one of the streaming methods below — all resumable. Full download guide →

  1. NEMAR CLI recommended

    Pulls the pinned version + annexed data and resumes cleanly. Install nemar-cli →

    nemar dataset download on004952
  2. DataLad

    Clone the dataset repo and fetch file content on demand. Docs →

    datalad clone https://github.com/nemarDatasets/on004952 on004952
    cd on004952 && datalad get .
  3. git-annex

    Plain git + git-annex against the dataset repo. Docs →

    git clone https://github.com/nemarDatasets/on004952 on004952
    cd on004952 && git annex get .
  4. Direct files (wget / curl / rclone)

    Every file with a stable, range-resumable URL from the manifest. Needs curl, jq, wget (or rclone/aria2c). Docs →

    curl -s https://data.nemar.org/on004952/v1.0.0/manifest.json | jq -r '.[].bytes_url' > urls.txt
    wget -xc -i urls.txt

Compute on this dataset

Two routes today, with a third (in-browser one-click submission) landing soon.

  1. NeuroScience Gateway (NSG) portal.

    NSG runs EEGLAB / Brainstorm / MNE pipelines on supercomputing time donated by SDSC. Create an account, point a job at this dataset's S3 prefix (s3://nemar/on004952), and submit.
    nsgportal.org →

  2. Local processing with nemar-cli.

    Pull the dataset to your machine and run any toolbox locally. Honors the published version pinning.

    npm install -g nemar-cli
    nemar dataset clone on004952
    cd on004952 && nemar dataset get
  3. Just the files.

    rclone, aria2c, or any HTTPS client works against data.nemar.org/on004952/ — the manifest carries presigned S3 URLs.

Direct compute access is coming soon. One-click NSG submission from this page is scoped for a follow-up phase. Tracked on nemarOrg/website#6.

Citations

    Loading demographics…

    Files

    Loading file index…