Expectation Validation Result

Evaluates whether a batch of data matches expectations.

Actions

Validation Filter:

Overview
Expectation Suite: titanic.subdir_reader.Titanic.BasicDatasetProfiler
Status: Failed
Statistics
Evaluated Expectations
51
Successful Expectations
43
Unsuccessful Expectations
8
Success Percent
≈84.31%

Info
Great Expectations Version
0.9.0b1+395.gf0f20d3b.dirty
Run ID
test_run_id_12345
Batch Markers
ge_load_time
20200131T214156.516057Z
pandas_data_fingerprint
3aaabc12402f987ff006429a7756f5cf
Batch Kwargs
datasource
titanic
path
/private/var/folders/8_/yrr889196455tj9104lt6q900000gn/T/pytest-of-james/pytest-113/project_dir2/project_path/data/titanic/Titanic.csv
Table-Level Expectations
Status Expectation Observed Value
Must have more than 0 rows.
1,313
Must have a list of columns in a specific order, but that order is not specified.
['Unnamed: 0', 'Name', 'PClass', 'Age', 'Sex', 'Survived', 'SexCode']
Unnamed: 0
Status Expectation Observed Value
value types must belong to this set: BIGINT BYTEINT DECIMAL INT INTEGER IntegerType LongType SMALLINT TINYINT int.
int64
may have any number of unique values.
1,313
may have any fraction of unique values.
1
values must not be null, at least 50 % of the time.
100% not null
values must belong to this set: [ ].

1,313 unexpected values found. 100% of 1,313 total rows.
Unexpected Value Count
1
1
2
1
3
1
4
1
5
1
6
1
7
1
8
1
9
1
10
1
11
1
12
1
13
1
14
1
15
1
16
1
17
1
18
1
19
1
20
1
100% unexpected
values must be unique.
0% unexpected
Name
Status Expectation Observed Value
value types must belong to this set: CHAR StringType TEXT VARCHAR str string.
0% unexpected
may have any number of unique values.
1,310
may have any fraction of unique values.
≈0.9977151561
values must not be null, at least 50 % of the time.
100% not null
values must belong to this set: [ ].

1,313 unexpected values found. 100% of 1,313 total rows.
Unexpected Value Count
Carlsson, Mr Frans Olof
2
Connolly, Miss Kate
2
Kelly, Mr James
2
Allen, Miss Elisabeth Walton
1
Allison, Master Hudson Trevor
1
Allison, Miss Helen Loraine
1
Allison, Mr Hudson Joshua Creighton
1
Allison, Mrs Hudson JC (Bessie Waldo Daniels)
1
Anderson, Mr Harry
1
Andrews, Miss Kornelia Theodosia
1
Andrews, Mr Thomas, jr
1
Appleton, Mrs Edward Dale (Charlotte Lamson)
1
Artagaveytia, Mr Ramon
1
Astor, Colonel John Jacob
1
Astor, Mrs John Jacob (Madeleine Talmadge Force)
1
Aubert, Mrs Leontine Pauline
1
Barkworth, Mr Algernon H
1
Baumann, Mr John D
1
Baxter, Mr Quigg Edmond
1
Baxter, Mrs James (Helene DeLaudeniere Chaput)
1
100% unexpected
values must not match this regular expression: ^\s+|\s+$.

3 unexpected values found. ≈0.2285% of 1,313 total rows.
Unexpected Value Count
Barber, Ms
1
Daly, Mr Peter Denis
1
Geiger, Miss Emily
1
≈0.22848% unexpected
PClass
Status Expectation Observed Value
value types must belong to this set: CHAR StringType TEXT VARCHAR str string.
0% unexpected
may have any number of unique values.
4
may have any fraction of unique values.
≈0.003046458492
values must not be null, at least 50 % of the time.
100% not null
values must belong to this set: [ ].

1,313 unexpected values found. 100% of 1,313 total rows.
Unexpected Value Count
3rd
711
1st
322
2nd
279
*
1
100% unexpected
values must not match this regular expression: ^\s+|\s+$.
0% unexpected
distinct values must belong to a set, but that set is not specified.
['*', '1st', '2nd', '3rd']
Age
Status Expectation Observed Value
value types must belong to this set: DOUBLE_PRECISION DoubleType FLOAT FLOAT4 FLOAT8 FloatType NUMERIC float.
float64
may have any number of unique values.
75
may have any fraction of unique values.
≈0.09920634921
values must not be null, at least 50 % of the time.
≈57.578% not null
values must belong to this set: [ ].

756 unexpected values found. ≈57.58% of 1,313 total rows.
Unexpected Value Count
22.0
35
21.0
31
30.0
31
18.0
30
36.0
29
24.0
27
26.0
27
27.0
24
28.0
24
19.0
23
20.0
23
23.0
23
25.0
23
32.0
22
45.0
20
29.0
15
31.0
14
33.0
14
35.0
14
39.0
14
≈57.578% unexpected
minimum value may have any numerical value.
0.17
maximum value may have any numerical value.
71
mean may have any numerical value.
≈30.39798942
median may have any numerical value.
28
quantiles must be within the following value ranges.
Quantile Min Value Max Value
0.05
Any
Any
Q1
Any
Any
Median
Any
Any
Q3
Any
Any
0.95
Any
Any
Quantile Value
0.05
6.0
Q1
21.0
Median
28.0
Q3
39.0
0.95
57.0
can match any distribution.
KL Divergence: None (-infinity, infinity, or NaN)
Sex
Status Expectation Observed Value
value types must belong to this set: CHAR StringType TEXT VARCHAR str string.
0% unexpected
may have any number of unique values.
2
may have any fraction of unique values.
≈0.001523229246
values must not be null, at least 50 % of the time.
100% not null
values must belong to this set: [ ].

1,313 unexpected values found. 100% of 1,313 total rows.
Unexpected Value Count
male
851
female
462
100% unexpected
values must not match this regular expression: ^\s+|\s+$.
0% unexpected
distinct values must belong to a set, but that set is not specified.
['female', 'male']
Survived
Status Expectation Observed Value
value types must belong to this set: BIGINT BYTEINT DECIMAL INT INTEGER IntegerType LongType SMALLINT TINYINT int.
int64
may have any number of unique values.
2
may have any fraction of unique values.
≈0.001523229246
values must not be null, at least 50 % of the time.
100% not null
values must belong to this set: [ ].

1,313 unexpected values found. 100% of 1,313 total rows.
Unexpected Value Count
null
863
1
450
100% unexpected
distinct values must belong to a set, but that set is not specified.
[0, 1]
SexCode
Status Expectation Observed Value
value types must belong to this set: BIGINT BYTEINT DECIMAL INT INTEGER IntegerType LongType SMALLINT TINYINT int.
int64
may have any number of unique values.
2
may have any fraction of unique values.
≈0.001523229246
values must not be null, at least 50 % of the time.
100% not null
values must belong to this set: [ ].

1,313 unexpected values found. 100% of 1,313 total rows.
Unexpected Value Count
null
851
1
462
100% unexpected
distinct values must belong to a set, but that set is not specified.
[0, 1]

Expectation Validation Result

Evaluates whether a batch of data matches expectations.