They cause postgresql to crash because it does not allow them in text fields.
They are seemingly only present accidentally in source documents;
so stripping them does not really impact the quality of metadata.
Resolves T4277.
Differential D8229
storage: Strip null characters from metadata documents Authored by vlorentz on Aug 10 2022, 10:45 AM.
Details
They cause postgresql to crash because it does not allow them in text fields. They are seemingly only present accidentally in source documents; Resolves T4277.
Diff Detail
Event TimelineComment Actions Build is green Patch application report for D8229 (id=29685)Rebasing onto 5313be86b3... Current branch diff-target is up to date. Changes applied before testcommit bb9082a6e5b95085ada61a917a2547f7d0a5c5e2
Author: Valentin Lorentz <vlorentz@softwareheritage.org>
Date: Wed Aug 10 10:44:31 2022 +0200
storage: Strip null characters from metadata documents
They cause postgresql to crash because it does not allow them in text fields.
They are seemingly only present accidentally in source documents;
so stripping them does not really impact the quality of metadata.See https://jenkins.softwareheritage.org/job/DCIDX/job/tests-on-diff/413/ for more details.
| ||||||||||||||||||||||||||||||||