Skip to content

Commit 67e36d9

Browse files
committed
Add file size validation check for noark53 and noark54
1 parent 3ed6c97 commit 67e36d9

File tree

2 files changed

+58
-8
lines changed

2 files changed

+58
-8
lines changed

noark-extraction-validator/src/main/resources/noark5/noark53/noark53-validation.xml

Lines changed: 29 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1994,6 +1994,31 @@
19941994
</test>
19951995

19961996
<test id="AST18">
1997+
<title>Document object file sizes</title>
1998+
<description>
1999+
Tests whether the document object file sizes specified in arkivstruktur.xml
2000+
match the ones with the physical files on the file system.
2001+
</description>
2002+
<group>arkivstruktur</group>
2003+
<queries>
2004+
<info>
2005+
<![CDATA[
2006+
SELECT COUNT(*) documents_with_mismatched_file_sizes
2007+
FROM arkivstruktur.dokumentobjekt
2008+
WHERE filstoerrelse <> _detected_file_size;
2009+
]]>
2010+
</info>
2011+
<errors>
2012+
<![CDATA[
2013+
SELECT referansedokumentfil document, filstoerrelse recorded_document_file_size, _detected_file_size detected_file_size
2014+
FROM arkivstruktur.dokumentobjekt
2015+
WHERE filstoerrelse <> _detected_file_size;
2016+
]]>
2017+
</errors>
2018+
</queries>
2019+
</test>
2020+
2021+
<test id="AST19">
19972022
<title>Correspondence parties</title>
19982023
<description>
19992024
Provides information about the number of correspondence parties grouped
@@ -2022,7 +2047,7 @@
20222047
</queries>
20232048
</test>
20242049

2025-
<test id="AST19">
2050+
<test id="AST20">
20262051
<title>Screenings</title>
20272052
<description>
20282053
Provides information about the number of screened series, classes,
@@ -2157,7 +2182,7 @@
21572182
</queries>
21582183
</test>
21592184

2160-
<test id="AST20">
2185+
<test id="AST21">
21612186
<title>Disposal decisions</title>
21622187
<description>
21632188
Provides information about the number of disposal decisions related to series, classes,
@@ -2292,7 +2317,7 @@
22922317
</queries>
22932318
</test>
22942319

2295-
<test id="AST21">
2320+
<test id="AST22">
22962321
<title>Disposals</title>
22972322
<description>
22982323
Provides information about the number of disposals of series
@@ -2377,7 +2402,7 @@
23772402
</queries>
23782403
</test>
23792404

2380-
<test id="AST22">
2405+
<test id="AST23">
23812406
<title>Personal name fields</title>
23822407
<description>
23832408
Checks whether all name fields contain seemingly valid personal names. The regular expressions used for

noark-extraction-validator/src/main/resources/noark5/noark54/noark54-validation.xml

Lines changed: 29 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1994,6 +1994,31 @@
19941994
</test>
19951995

19961996
<test id="AST18">
1997+
<title>Document object file sizes</title>
1998+
<description>
1999+
Tests whether the document object file sizes specified in arkivstruktur.xml
2000+
match the ones with the physical files on the file system.
2001+
</description>
2002+
<group>arkivstruktur</group>
2003+
<queries>
2004+
<info>
2005+
<![CDATA[
2006+
SELECT COUNT(*) documents_with_mismatched_file_sizes
2007+
FROM arkivstruktur.dokumentobjekt
2008+
WHERE filstoerrelse <> _detected_file_size;
2009+
]]>
2010+
</info>
2011+
<errors>
2012+
<![CDATA[
2013+
SELECT referansedokumentfil document, filstoerrelse recorded_document_file_size, _detected_file_size detected_file_size
2014+
FROM arkivstruktur.dokumentobjekt
2015+
WHERE filstoerrelse <> _detected_file_size;
2016+
]]>
2017+
</errors>
2018+
</queries>
2019+
</test>
2020+
2021+
<test id="AST19">
19972022
<title>Correspondence parties</title>
19982023
<description>
19992024
Provides information about the number of correspondence parties grouped
@@ -2022,7 +2047,7 @@
20222047
</queries>
20232048
</test>
20242049

2025-
<test id="AST19">
2050+
<test id="AST20">
20262051
<title>Screenings</title>
20272052
<description>
20282053
Provides information about the number of screened series, classes,
@@ -2157,7 +2182,7 @@
21572182
</queries>
21582183
</test>
21592184

2160-
<test id="AST20">
2185+
<test id="AST21">
21612186
<title>Disposal decisions</title>
21622187
<description>
21632188
Provides information about the number of disposal decisions related to series, classes,
@@ -2292,7 +2317,7 @@
22922317
</queries>
22932318
</test>
22942319

2295-
<test id="AST21">
2320+
<test id="AST22">
22962321
<title>Disposals</title>
22972322
<description>
22982323
Provides information about the number of disposals of series
@@ -2377,7 +2402,7 @@
23772402
</queries>
23782403
</test>
23792404

2380-
<test id="AST22">
2405+
<test id="AST23">
23812406
<title>Personal name fields</title>
23822407
<description>
23832408
Checks whether all name fields contain seemingly valid personal names. The regular expressions used for

0 commit comments

Comments
 (0)