Seed lists for BelgicaWeb research project related to the archiving of web and social media content (doi:10.34934/DVN/C0DGSS)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Entire Codebook

Document Description

Citation

Title:

Seed lists for BelgicaWeb research project related to the archiving of web and social media content

Identification Number:

doi:10.34934/DVN/C0DGSS

Distributor:

Social Sciences and Digital Humanities Archive – SODHA

Date of Distribution:

2025-08-26

Version:

1

Bibliographic Citation:

Geeraert, Friedel; Vandendyck, Christina, 2025, "Seed lists for BelgicaWeb research project related to the archiving of web and social media content", https://doi.org/10.34934/DVN/C0DGSS, Social Sciences and Digital Humanities Archive – SODHA, V1, UNF:6:V1hHFXDQD/rmLVuNusTOTg== [fileUNF]

Study Description

Citation

Title:

Seed lists for BelgicaWeb research project related to the archiving of web and social media content

Identification Number:

doi:10.34934/DVN/C0DGSS

Authoring Entity:

Geeraert, Friedel (KBR)

Vandendyck, Christina (KBR)

Distributor:

Social Sciences and Digital Humanities Archive – SODHA

Access Authority:

Geeraert, Friedel

Access Authority:

Vandendyck, Christina

Depositor:

Geeraert, Friedel

Date of Deposit:

2025-08-26

Holdings Information:

https://doi.org/10.34934/DVN/C0DGSS

Study Scope

Keywords:

Arts and Humanities, Computer and Information Science

Abstract:

These spreadsheets are part of the research data produced within the BelgicaWeb project. It is a BRAIN 2.0 project funded by BELSPO (2024-2026). BelgicaWeb aims to make Belgium’s born-digital heritage accessible and FAIR, i.e. Findable, Accessible, Interoperable and Reusable by developing a user-friendly access platform and an API that enables access at data level. The spreadsheets list the seeds that were given to the Browsertrix crawler software to create a corpus of archived web and social media content.

Methodology and Processing

Sources Statement

Data Access

Notes:

<a rel="license" href="http://creativecommons.org/licenses/by/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by/4.0/88x31.png" /></a><br />This work is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</a> (CC-BY).

Other Study Description Materials

File Description--f5084

File: 20250825_full_list_PROMISE.tab

  • Number of cases: 3137

  • No. of variables per record: 1

  • Type of File: text/tab-separated-values

Notes:

UNF:6:aZGbFBi6llFhObvC7/O2CA==

File Description--f5083

File: 20250826_full_list_BESOCIAL.tab

  • Number of cases: 1507

  • No. of variables per record: 1

  • Type of File: text/tab-separated-values

Notes:

UNF:6:9BmPTzS+0mtyHBdIWLBRBQ==

Variable Description

List of Variables:

Variables

http://www.parts.be/

f5084 Location:

Variable Format: character

Notes: UNF:6:aZGbFBi6llFhObvC7/O2CA==

https://www.kbr.be/en/projects/besocial/

f5083 Location:

Variable Format: character

Notes: UNF:6:9BmPTzS+0mtyHBdIWLBRBQ==