Word Safari Application

Overview

The interface component of the project focuses on developing a solution for improving the accessibility of contributing lexicographical data to Wikidata for two Niger-Congo B languages - isiZulu and isiXhosa. It does this through easy-to-use gamified interfaces that are accessible to speakers of the two languages and these interfaces both collect direct translations and noun class classifications.

Problem

One of the problems relating to the under-representation of lexicographical data of Niger-Congo B languages on Wikidata is that the current Wikidata interface is aimed at expert users. This pool of users is small for Niger-Congo B Languages. Other popular Wikidata interfaces aimed at collecting data are for Indo-European languages and do not take the agglutinative structure of Niger-Congo B languages into account. This agglutinative structure refers to how the noun class changes the sentence structure which is important for the sentence generation aspect of the Abstract Wikipedia project and this data needs to be collected along with direct translations.

Objectives

See if gamified interfaces can be used to motivate users to contribute more lexicographical data.

Comparing two gamified interfaces

Testing how much more effective is a gamified interface with a leaderboard compared to one without a leaderboard at motivating users to contribute lexicographical data.

Comparing gamification to Wikidata

How effective are gamified interfaces at collecting lexicographical data compared to the current Wikidata interface for two Niger-Congo B languages (such as isiZulu and isiXhosa).

Interface Design and Development

Word Safari Phone Screens

A high-fidelity prototype was first designed and tested and then two interfaces were developed from the feedback of the prototype. Word Safari Explorers is the basic interface that just uses a points system. Word Safari Champions is the advanced interface with leaderboards, feedback screens, daily streaks, along with the points system (same as the Explorers edition). Both games collect direct translations and noun class classifications and were developed as Android applications in Android Studio with Firebase Realtime Database.

Experiment

The experiment ran study over 5 days with 10 users per a game. The top 5 players by score from Word Safari Champions won R200. A random selection of 5 users from Word Safari Explorers were given R200.

97

Original Amount of Wikidata Lexemes

1145

Lexemes Collected from Explorers Game

2103

Lexemes Collected from Champions Game

Results

Gamified interfaces are more effective than Wikidata interface for collecting lexicographical data and the more gamified interface collected more words (collected 1145 lexemes for Explorers and 2103 lexemes for Champions - original amount of lexemes on Wikidata was 97).

An important finding was that monetary motivation was the main motivator for users, and not gamification itself and this is key in low-resource environments such as South Africa. Users also sometimes struggled to input the correct noun classes and this is an area for possible future work.

Documents

Literature
Review

Review of the academic literature relevant to gamified interfaces for collecting lexicographical data.


Open 

Project
Proposal

Assess the feasibility of the project and outlines the plans of the project.


Open 

Final
Paper

Final Paper containing more details about the project and sections mentioned above for Word Safari.


Open 

Project
Poster

Poster illustrating the overview of components developed within this project.


Open