Skip to main content

ConvertOfficeFormat

Description

Converts a Open Office compatable file, to a PDF or Docx format.

Tags

convert, docx, excel, office, openoffice, pdf, powerpoint, word

Properties

In the list below required Properties are shown with an asterisk (*). Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

Display NameAPI NameDefault ValueAllowable ValuesDescription
Service Location Strategy *Service Location StrategyDefault
  • Default
  • Custom
Determines how Service Locations are configured within this processor for the Office Conversion Service.
Custom Office Convert Service URL *Custom Office Convert Service URLThe Custom URL of the Datavolo Office Conversion Service.

This property is only considered if:
  • the property Service Location Strategy has a value of Custom
Communication Timeout *Communication Timeout60 secThe amount of time to wait for a response from the microservices before timing out.
Output Type *Output TypePDF
  • PDF
  • DOCX
  • XLSX
The type of file that should be output from the conversion.
Convert DOC to DOCX Before PDF Conversion *Convert DOC to DOCX Before PDF Conversiontrue
  • false
  • true
Enabling DOC to DOCX conversion requires an additional step before converting to PDF. Conversion to PDF from DOC often produces low quality results, so converting from DOC to DOCX and then to PDF yields better output rendering.

This property is only considered if:
  • the property Output Type has a value of PDF
Include Annotations *Include Annotationstrue
  • false
  • true
Include Notes as PDF Annotations.

This property is only considered if:
  • the property Output Type has a value of PDF

Dynamic Properties

This component does not support dynamic properties.

Relationships

NameDescription
comms.failureIf the processor is unable to communicate with one of the necessary services, the input FlowFile will be routed to this relationship.
failureIf the text of a FlowFile cannot be extracted for any reason, the input FlowFile will be routed to this relationship.
successThe text of the PDF is routed to the success relationship.

Reads Attributes

This processor does not read attributes.

Writes Attributes

This processor does not write attributes.

State Management

This component does not store state.

Restricted

This component is not restricted.

Input Requirement

This component requires an incoming relationship.

System Resource Considerations

This component does not specify system resource considerations.

See Also