Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | |||
cluster:scoring:data_flow [2024/10/27 20:28] – benedikt | cluster:scoring:data_flow [2024/10/27 20:29] (current) – benedikt | ||
---|---|---|---|
Line 6: | Line 6: | ||
There are 3 general rules that we try to stick to. | There are 3 general rules that we try to stick to. | ||
+ | - The scorer/user **never** works directly with the 'raw copy' or backup copy, but rather the data being worked with should be copied to the workspace folder. | ||
+ | - Datasets should be ' | ||
+ | - We always keep a 'raw copy' of the data, in the state that we collected it/received it. We try to maintain backups of each 'stage of the process the data goes through' | ||
- | |||
- | - The scorer/user **never** works directly with the 'raw copy' or backup copy, but rather the data being worked with should be copied to the workspace folder. | ||
- | - Datasets should be ' | ||
- | - We always keep a 'raw copy' of the data, in the state that we collected it/received it. We try to maintain backups of each 'stage of the process the data goes through' | ||
These 3 rules, if followed consistently, | These 3 rules, if followed consistently, |