The goal of this proposal is to digitize similar data from the Vietnam era to facilitate the processing of health claims by the VA. While data on a wide variety of combat events have been declassified and are available in text-format through the National Archives, these files cannot currently be used by the OMAR Dashboard or similar systems. The project will digitize these extant files to produce usable files (i.e. Excel format) to be linked to the OMAR Dashboard or similar systems…
The following tasks will be performed to meet the objectives listed above.
- Convert each of the fourteen record types from text-format to Excel format.
- Clean the extracted files to ensure that their content is usable (include headers, remove problematic unicode text, enforce uniformity in formatting within columns and across files).
- Extract relevant information (time, location, and type of event) from each of the twelve combat incident files into a single master Excel sheet to replicate the SIGACTs data available for Iraq and Afghanistan…