Skip to contents

Combine multiple data frames based on their common columns. That's the first step for preprocessing with the barn package. When printing it shows the characteristics of the combined datasets.

Usage

barn(..., nominal_sufix = "_cat", numeric_sufix = "_num")

# S3 method for class 'barn'
print(x, form_width = 30, ...)

Arguments

...

Extra arguments.

nominal_sufix

An optional string for dealing with nominal variables. Defaults to "_cat".

numeric_sufix

An optional string for dealing with numeric variables. Defaults to "_num".

x

An object of class "barn".

form_width

An integer specifying the minimum column width (in characters). Default is 30.

Value

A barn object containing the combined data frame, row counts,

Examples

full <- data.frame(id = 1:3, p1 = c("A", "B", "C"), p2 = 10:12, y = 1:3)
holdout <- data.frame(id = 4:5, p1 = c("D", "E"), p2 = 1:2)
original <- data.frame(id = 1:2, p1 = c("F", "G"), p2 = 3:4, y = 4:5)
print(barn(full, holdout, original))
#> 
#> ── Barn                           ──────────────────────────────────────────────
#> 
#> ── Settings                       
#> nominal_sufix: _cat
#> numeric_sufix: _num
#> 
#> 
#> ── Datasets                       
#> # The combined dataset: 7 x 3
#> # A tibble: 3 × 2
#>   dataset  row_count
#>   <chr>        <int>
#> 1 full             3
#> 2 holdout          2
#> 3 original         2
#> 
#> ── Exploratory Data Analysis      
#> # A tibble: 3 × 4
#>   variable type    unique missing
#>   <chr>    <chr>    <int>   <int>
#> 1 id       numeric      5       0
#> 2 p1       numeric      7       0
#> 3 p2       numeric      7       0