COVID-19 Frequently Asked Questions in 12 Languages, November 2021
Updated 4y ago
Available on 1 platform
Sign in to view source links and access this dataset
Description
A multilingual FAQ dataset from the Oregon government, last updated December 1, 2021. Columns suggest it contains pairs of questions and answers about COVID-19, translated into languages including Spanish, Vietnamese, Russian, Arabic, and Hmong. The data is provided in CSV, JSON, XML, and RDF formats.
Use Cases
Training a multilingual question-answering model using the 'Question' and 'Answer' columns in English and their translations (inferred from domain, verify after download).
Analyzing public health information needs by comparing 'FAQ ID' across different language versions of the same question (inferred from domain, verify after download).
Building a translation corpus for low-resource languages like Chuukese or Marshallese using the language-specific question and answer columns (inferred from domain, verify after download).
Strengths
Published on the Socrata platform by data.oregon.gov.
Covers at least 12 languages, as indicated by column names for questions and answers.
Limitations
Metadata is minimal; actual content requires verification after download.
Last updated 2021-12-01 22:32:47; freshness should be verified.
Row count is unknown, which may limit suitability assessment.
Provenance
Source
data.oregon.gov
Time Range
As of November 24, 2021
Freshness
2021-12-01
Geography
Likely Oregon, USA (inferred from source domain).
License is unknown; check terms of use before application.