Project 23 Argentina Travel Archive: A Structured Dataset of Argentina Travel Articles, Video Transcripts, Photography Metadata, and Public Source Records

Published: 1 June 2026| Version 2 | DOI: 10.17632/f3ygxw39tk.2
Contributors:
,

Description

Project 23 Argentina Travel Archive is a structured dataset preserving Argentina-focused travel articles, video transcripts, photography metadata, media references, and public source records from the Samuel & Audrey Media Network. The dataset contains 10,142 records connected to Project 23, a long-term documentation effort focused on Argentina’s 23 provinces through travel guides, videos, photography, regional logistics, cultural coverage, destination research, and source-linked archive records. The package includes 164 blog posts and pages, 695 YouTube transcript records, 9,247 photography metadata rows, 24 media reference records, and supporting index and methodology records. The archive includes structured records related to Argentina travel articles, YouTube videos, video transcripts, photography metadata, public references, geographic coverage, regional logistics, and media-network source material. Records are designed to support research, retrieval, archive search, destination coverage review, tourism communication analysis, regional media studies, bilingual transcript analysis, geospatial metadata exploration, and non-commercial analysis of Argentina-focused travel documentation. Project 23 should be interpreted as a structured travel media and source archive, not as a complete government, statistical, infrastructure, or official tourism database. Records may include historical travel information, creator-defined regional labels, normalized metadata fields, source URLs, media references, and archive notes. Some practical travel details, destination conditions, prices, routes, transportation schedules, accommodation information, and source URLs may change over time. Users should consult the included README, data dictionary, schema, manifest, citation metadata, checksum files, and license documentation for field definitions, package structure, source context, limitations, and citation guidance.

Files

Steps to reproduce

Records were compiled from Samuel & Audrey Media Network source materials related to Argentina-focused travel coverage and Project 23. Source categories include travel articles and pages, YouTube video transcript records, photography metadata rows, media reference records, and supporting index and methodology records. The cleaned dataset package contains 10,142 structured records. Records were normalized into a flat archive structure with stable record identifiers, record types, sections, titles, source URLs where available, language fields, text fields where applicable, source metadata, and checksum fields. Record types include archive_intro, archive_methodology, media_reference, blog_post, youtube_video_transcript, image_meta, and related index records. The canonical structured data files are provided in JSONL and CSV formats, with compressed versions included for easier upload and reuse. The CSV export includes selected convenience columns plus a JSON column containing the full structured record. Supporting documentation includes a README, data dictionary, JSON schema, manifest, citation metadata, license file, SHA-256 checksums, and LLM-oriented text files for retrieval and archive navigation. This version replaces earlier “Argentina Authority Ledger” framing with the cleaned Project 23 Argentina Travel Archive naming and plain-English archive terminology.

Categories

Arts and Humanities, Social Sciences, Tourism, Economics, Media Studies, Data Science, Travel Behavior, Geospatial Data Repository, Transportation by Region, Infrastructure, Digital Media Studies

Licence