Title: | Mini Games from Adventures of Beta and Bit |
---|---|
Description: | Three games: proton, frequon and regression. Each one is a console-based data-crunching game for younger and older data scientists. Act as a data-hacker and find Slawomir Pietraszko's credentials to the Proton server. In proton you have to solve four data-based puzzles to find the login and password. There are many ways to solve these puzzles. You may use loops, data filtering, ordering, aggregation or other tools. Only basics knowledge of R is required to play the game, yet the more functions you know, the more approaches you can try. In frequon you will help to perform statistical cryptanalytic attack on a corpus of ciphered messages. This time seven sub-tasks are pushing the bar much higher. Do you accept the challenge? In regression you will test your modeling skills in a series of eight sub-tasks. Try only if ANOVA is your close friend. It's a part of Beta and Bit project. You will find more about the Beta and Bit project at <https://github.com/BetaAndBit/Charts>. |
Authors: | Przemyslaw Biecek [aut, cre], Witold Chodor [trl], Katarzyna Fak [aut], Tomasz Zoltak [aut], Foundation SmarterPoland.pl [cph] |
Maintainer: | Przemyslaw Biecek <[email protected]> |
License: | GPL-2 |
Version: | 2.2 |
Built: | 2024-11-17 06:29:33 UTC |
Source: | CRAN |
The character vector of recently executed commands. Each element of the vector consists of command's name and command's arguments separated with a space.
data(bash_history)
data(bash_history)
a character vector with 19913 elements.
It was conducted firstly in the same time as PISA 2009 study, with use of the same cognitive tests and questionnaires as in PISA 2009, but on a different group of students: first grade students of upper-secondary schools (in Poland most of the students in a regular PISA sample attends lower-secondary schools). The students who participated in the first wave of the study were followed in the 2nd grade of upper-secondary school within the research program Our further study and work (Nasza Dalsza Nauka i Praca). Both studies were conducted by the Institute of Philosophy and Sociology Polish Academy of Sciences.
data frame: 3796 obs. of 54 variables
The original data was changed a little, to better fit the purpose of the game.
The dataset describing names, surnames and faculty employees' logins. Note that it is an artificial dataset that imitates real database. The subsequent columns in this dataset describe:
name. The name of an employee.
surname. The surname of an employee.
login. The login of an employee on the Proton server.
a data frame with 541 rows and three columns.
The vector with frequencies of 26 English letters. It is sorted by the frequency of usage. May be used to refine the transliteraton.
data(EnglishLetterFrequency)
data(EnglishLetterFrequency)
a named vector with 26 elements.
The Dutch institute Rijksinstituut voor Volksgezondheid en Milieu has compiled a database with ingredient information on more than 2,200 food products. The data in this package is only a processed fraction of the huge and very interesting NEVO database available at https://www.rivm.nl/documenten/nevo-online-versie.
data frame: 2207 obs. of 9 variables
The preprocessed data can be used to reproduce the charts from the book Wykresy od kuchni (Chart runners) https://github.com/BetaAndBit/Charts
Note that data frames food
, food_max
, food_mini
, food_all
have product names in English, while
food_pl
, food_max_pl
, food_mini_pl
, food_all_pl
have product names in Polish.
library("ggplot2") head(food) library("ggthemes") ggplot(data = food, aes(x = Energy)) + geom_histogram(color = "white") + facet_wrap(~Group) + labs(title = "Energy value of the products", subtitle = "per 100 g", x = "Energy value", y = "Number") + theme_economist() ggplot(data = food_mini, aes(x = Energy)) + geom_histogram(color = "white") + facet_wrap(~Group) + labs(title = "Energy value of the products", subtitle = "per 100 g", x = "Energy value", y = "Number") + theme_economist() ggplot(data = food, aes(x = Protein, y = Fats, color = Group, size = Energy)) + geom_point() + scale_color_brewer(type = "qual", palette = "Dark2") + labs(title = "Share of protein and fats", subtitle = "per 100 g", y = "Fats [g]", x = "Protein [g]") + theme_gdocs() ggplot(data = food, aes(x = Group, y = Energy)) + geom_rug(sides = "l") + geom_violin(scale = "width", aes(fill = Group)) + geom_text(data = food_max, aes(label = Name), hjust = 0, vjust = 0, color = "blue4") + geom_boxplot(width = 0.2, coef = 100) + coord_flip() + labs(title = "Energy value distribution", subtitle = "per 100 g") + theme_gdocs() + theme(legend.position = "none")
library("ggplot2") head(food) library("ggthemes") ggplot(data = food, aes(x = Energy)) + geom_histogram(color = "white") + facet_wrap(~Group) + labs(title = "Energy value of the products", subtitle = "per 100 g", x = "Energy value", y = "Number") + theme_economist() ggplot(data = food_mini, aes(x = Energy)) + geom_histogram(color = "white") + facet_wrap(~Group) + labs(title = "Energy value of the products", subtitle = "per 100 g", x = "Energy value", y = "Number") + theme_economist() ggplot(data = food, aes(x = Protein, y = Fats, color = Group, size = Energy)) + geom_point() + scale_color_brewer(type = "qual", palette = "Dark2") + labs(title = "Share of protein and fats", subtitle = "per 100 g", y = "Fats [g]", x = "Protein [g]") + theme_gdocs() ggplot(data = food, aes(x = Group, y = Energy)) + geom_rug(sides = "l") + geom_violin(scale = "width", aes(fill = Group)) + geom_text(data = food_max, aes(label = Name), hjust = 0, vjust = 0, color = "blue4") + geom_boxplot(width = 0.2, coef = 100) + coord_flip() + labs(title = "Energy value distribution", subtitle = "per 100 g") + theme_gdocs() + theme(legend.position = "none")
The frequon
function is used for solving problems in the data-based game ,,The Frequon Game”.
frequon(...)
frequon(...)
... |
|
Every time when some additional hints are needed one should add
hint=TRUE
argument to the frequon
function.
In this game you are in contact with a group of people that are going to stop terrorists.
You can communicate with them through frequon
function.
In each call add subject
parameter that will indicate which message you are answering.
Add content
parameter. It's value should match the request.
,,The Frequon Game” is a free of charge, educational project of the SmarterPoland.pl Foundation.
Katarzyna Fak - the idea and the implementation,
Przemyslaw Biecek - comments and the integration with the 'BetaBit' package.
frequon() frequon(hint=TRUE)
frequon() frequon(hint=TRUE)
It was conducted firstly in the same time as PISA 2009 study, with use of the same cognitive tests and questionnaires as in PISA 2009, but on a different group of students: first grade students of upper-secondary schools (in Poland most of the students in a regular PISA sample attends lower-secondary schools). The students who participated in the first wave of the study were followed in the 2nd grade of upper-secondary school within the research program Our further study and work (Nasza Dalsza Nauka i Praca). Both studies were conducted by the Institute of Philosophy and Sociology Polish Academy of Sciences.
data frame: 3796 obs. of 54 variables
The original data was changed a little, to better fit the purpose of the game.
The messages to be decoded in the game 'frequon()'. How to access it? You have to figure out this by yourself.
The dataset describing the history of logs: who, from where and when logged into the Proton server. The subsequent columns in this dataset describe:
login. The login of the user which logs into the Proton server.
host. The IP address of the computer, from which the log into the Proton server was detected.
date. The date of log into the Proton server. Rows are sorted by this column.
data(logs)
data(logs)
a data frame with 59366 rows and 3 columns.
The proton
function is used for solving problems in the data-based game ,,The Proton Game”.
Solve four data-based puzzles in order to crack into Pietraszko's account!
proton(...)
proton(...)
... |
|
Every time when some additional hints are needed one should add
hint=TRUE
argument to the proton
function.
In order to get more information about a user on the Proton server
one should pass action = "login"
, login="XYZ"
arguments
to the proton
function.
In order to log into the Proton server one should pass action = "login"
,
login="XYZ"
, password="ABC"
arguments to the proton
function.
If the password matches login, then one will receive a message about successful login.
In order to log into a server different from Proton one should pass
action = "server"
, host="XYZ"
arguments to the proton
function.
,,The Proton Game” is a free of charge, educational project of the SmarterPoland.pl Foundation.
Przemyslaw Biecek, [email protected], SmarterPoland.pl Foundation.
proton() proton(hint=TRUE)
proton() proton(hint=TRUE)
Reads data in GPX form. Examples of tourist routes saved in this format can be downloaded from mapa-turystyczna.pl.
read_gpx(path, name = NULL, uniform = TRUE, dx = 25, span = 0.1) ## S3 method for class 'gpx_file' plot(x, ..., type = "profile", color = "magenta")
read_gpx(path, name = NULL, uniform = TRUE, dx = 25, span = 0.1) ## S3 method for class 'gpx_file' plot(x, ..., type = "profile", color = "magenta")
path |
path the the gpx file with information about the route |
name |
name of the route |
uniform |
if TRUE then route will be converted into a uniform grid of points |
dx |
if uniform is TRUE then dx is the grid size |
span |
if uniform is TRUE then span is smoothing parameter |
x |
routs to be plotted |
... |
other parameters |
type |
what should be plotted? 'profile' for profiles, 'difference' for derivative, 'boxplot' absolute derivative |
color |
names of colors for lines |
Przemyslaw Biecek
The regression
function is used for solving problems in the data-based
game ,,The regression Game”.
regression(...)
regression(...)
... |
|
Every time when some additional hints are needed one should add
hint = TRUE
or techHint = TRUE
argument to the regression
function.
Technical hints will point out R packages and/or functions which might help
you to solve the task while "normal" hints provide you with methodological
advices.
In this game you are helping Professor Pearson.
You can communicate with him through the regression
function.
In each call include the subject
parameter (indicating which task
you are trying to answer) and the content
parameter (providing
information Professor Pearson is asking you for in a given task).
Data used in the game comes from the study of Polish upper-secondary schools first grade students. It was conducted together with the PISA 2009 study using the same cognitive tests and questionnaires as in PISA 2009 but on a different group of students (in Poland most of the students in a PISA sample attends lower-secondary schools). The students who participated in the first wave of the study were followed in the 2nd grade of upper-secondary school within the research program Our further study and work (Nasza Dalsza Nauka i Praca). Both studies were conducted by the Institute of Philosophy and Sociology Polish Academy of Sciences. The original data was changed a little, to better fit the purpose of the game.
,,The Regression Game” is a free of charge, educational project of the SmarterPoland.pl Foundation.
Function returns one of three possible values:
TRUE
if you provided correct answer to a task,
FALSE
if you provided wrong answer to a task,
NULL
if function can't identify task you wanted to answer.
Tomasz Zoltak - the idea and the implementation,
Mateusz Zoltak - comments, contribution to hints,
Zuzanna Brzozowska - proofreading,
Przemyslaw Biecek - comments and the integration with the 'BetaBit' package.
regression() regression(hint = TRUE) regression(techHint = TRUE)
regression() regression(hint = TRUE) regression(techHint = TRUE)
The character vector of 1000 most commonly used passwords. It is sorted by the frequency of password's usage. First passwords in the vector are the most frequently used.
a character vector with 1000 elements.
The character vector of 100 most commonly used words in English. It is sorted by the frequency of usage. May be used to refine the transliteraton.
data(top100commonWords)
data(top100commonWords)
a character vector with 100 elements.
dataDNiP
and
DNiP
datasets.The data frame containng labels of the variables from dataDNiP
and
DNiP
datasets.
data frame: 54 obs. of 2 variables
The named list with 18 languages. Based on <https://wikiquote.org/>.
data(wikiquotes)
data(wikiquotes)
a named list with 18 elements.