stimmenfryslan/Readme.md

6.3 KiB
Raw Permalink Blame History

Stimmen Fryslan

Reproducibiliy results [paper xyz]

These notebooks allow for the reroducabiluty, they require access to the stimmen mysql database. One needs to request to this database.

General statistics

Statistics for Nanna's email of 2019-02-13

Calculates statistics of the stimmen app usage.

Regions

Partition provinces in wijken and gemeentes

Partitions Fryslan, the Dutch province, with repesct to two granularities, as defined by the CBS 'wijken' and 'gemeentes' of 2017. These partitionings are used in all maps created with the other notebooks.

Heatmaps

[Frysian pronunciation occurrence](notebooks/Frysian pronunciation occurrence.ipynb)

Creates all heatmaps illustrating the distribution of one pronunciation relative to all other pronunciations of that word.

Result example:

Several pronunciation heatmaps maps for the word zaterdag, respectively across both rows for gemeentes and wijken granularities.

snjoun sa:tədex sɑ:təɾjə snɪwn
example pronunciation occurence map example pronunciation occurence map example pronunciation occurence map example pronunciation occurence map
example pronunciation occurence map example pronunciation occurence map example pronunciation occurence map example pronunciation occurence map

Distribution maps

Creates maps for both granularities, each illustrating the pronunciation distribution of one word.

[Frysian pronunciation distribution maps](notebooks/Frysian pronunciation distribution maps.ipynb)

Result example:

Several pronunciation distribution maps for different words, respectively across both rows for gemeentes and wijken granularities.

zaterdag vis geel oog
example pronunciation distribution map example pronunciation distribution map example pronunciation distribution map example pronunciation distribution map
example pronunciation distribution map example pronunciation distribution map example pronunciation distribution map example pronunciation distribution map

Notebooks

Extract Frysian dialect regions

notebook

Get polygons of dialect regions as mapped in this image

dialect regions

using image processing.

Results

Group recordings to Frysian dialect regions

notebook

Create spreadsheets with the recordings assigned to dialect regions.

Results

Segment Friesland (and Groningen) in Gemeentes and Wijken

notebook

Some of the wijken are merges, for example part of Leeuwarden, to avoid that the segementation gets too fine grained.

notebook

Visualized maps of the segmentations.

Results:

  • data/Friesland_gemeentes.geojson
  • data/Friesland_gemeentes.kml
  • data/Friesland_wijken.geojson
  • data/Friesland_wijken.kml
  • data/Groningen_gemeentes.geojson
  • data/Groningen_gemeentes.kml
  • data/Groningen_wijken.geojson
  • data/Groningen_wijken.kml

Posterior probabilities and Likelyhoods for origin based on word pronunciation

notebook

Tables with the posterior probabilities and likelihoods of being from a region based on the stated pronunciation of one specific word.

Gabmap tab seperated files

notebook

Create tab separated files to be used by gapmap, based on the geojson regions as created by

Segment Provinces in Wijken and Gemeentes.ipynb

This is a simple example for the created gabmap files.

notebook

Bar Maps per word for Pronunciation Occurrence in Frysian Municipalities

For each word, a map illustrates the pronunciation occurrence as measured by the prediction quiz, per Frysian municipality.

notebook

Heatmap per word for Pronunciation Occurrence in Frysian Municipalities

notebook

Each map displays the pronounciation occurence in Frysian municipalities for one word. Each pronunciation is represented by one map layer, and for one municipality layer the percentages for each pronunciation add up to 100% + rounding errors.

Heatmap per word for Pronunciation Occurrence in Frysian Neighborhoods

Same as for Municipalities, but for Neighborhoods.

notebook