How do I parse the result?

All possible result elements for the current extraction model “cvlizer_3_0” are described in the XML Schema Definition File (XSD) “jv_hr_3_0.xsd” which can be downloaded from the resources section. The XSD file may be used to create class files or data structures necessary to automatically handle or at least map the provided extraction results. What to create, and how, depends on what tool and programming language you are using. The JDK for instance contains an app called xjc to create fully annotated Java class files, for C# Visual Studio comes with the "xsd.exe" tool to do so.

Please note that some result element values will be denoted as abbreviations or codes,for example ISO code “AT” for “Austria” as the country value. A ZIP-archive containing CSV-files with these codes and abbreviations can be downloaded from the resources section under “Domains”.

Which input file formats are supported?

Our extraction services support the following plaintext and binary input formats, regardless of the applied semantic extraction mode:

All text processing formats and PDF may also contain scanned images, which will be detected and converted.

These file types are supported by the “categorize (REST/SOAP)” and “merge (REST) / mergeToXML (SOAP)” methods. In case of embedded archival formats (i.e. attached messages, .zip in mails), the files will be parsed recursively.

Which languages are supported?

The JoinVision extraction services support the following output languages for structured information (codes):

This does not affect any non-transformed information, as JoinVision does not provide any kind of translation service. Non-transformed information remains in the original language of the provided document. Only data of type or inheriting from type “codeNamePair” is provided in one of the languages stated above, based on the parameter provided to the web service

The JoinVision extraction services support the following input languages (document languages):

What information is extracted?

The extracted fields depend on the used extraction model.

CVlizer extracts the following information from CVs (without attachments):

*Standardized values