Knowledgebase:
Manual removal of the Content Processing Framework (CPF) from a database
14 September 2017 09:00 AM

Introduction

When CPF is installed, a number of new documents are created for the nominated Triggers database associated with that database.

This Knowledgebase article is designed to show you what CPF creates on install, in the event that you want to safely disable and remove it from your system.

Getting started

Below is a layout of all databases and their associated document counts with a clean install of MarkLogic 9.0-2:

Database IDDatabase NameDocument Count
8723423541597683063 App-Services 14
12316032390759111212 Modules 0
1695527226691932315 Fab 0
11723073009075196192 Security 1526
15818912922008798974 Triggers 0
5212638700134402198 Documents 0
4320540002505594119 Extensions 0
9023394855382775954 Last-Login 0
11598847197347642387 Schemas 0
12603105430027950215 Meters 48

Adding CPF

After installing CPF on the Documents database (with conversion enabled), we now see:

Database IDDatabase NameDocument Count
8723423541597683063 App-Services 15
12316032390759111212 Modules 0
1695527226691932315 Fab 0
11723073009075196192 Security 1526
15818912922008798974 Triggers 39
5212638700134402198 Documents 0
4320540002505594119 Extensions 0
9023394855382775954 Last-Login 0
11598847197347642387 Schemas 0
12603105430027950215 Meters 498

If we ignore Meters and App-Services, we can see that by default, A CPF install will create a number of documents in the Triggers database:

/cpf/domains.css
/cpf/pipelines.css
http://marklogic.com/cpf/configuration/configuration.xml
http://marklogic.com/cpf/domains/4361761515557042908.xml
http://marklogic.com/cpf/pipelines/10451885084298751684.xml
http://marklogic.com/cpf/pipelines/11486027894562997537.xml
http://marklogic.com/cpf/pipelines/1182872541253698578.xml
http://marklogic.com/cpf/pipelines/11925472395644624519.xml
http://marklogic.com/cpf/pipelines/12665626287133680551.xml
http://marklogic.com/cpf/pipelines/12977232154552215987.xml
http://marklogic.com/cpf/pipelines/13371411038103584886.xml
http://marklogic.com/cpf/pipelines/13468360248543629252.xml
http://marklogic.com/cpf/pipelines/13721894103731640519.xml
http://marklogic.com/cpf/pipelines/14473927355946353823.xml
http://marklogic.com/cpf/pipelines/16071401642383641119.xml
http://marklogic.com/cpf/pipelines/17008133204004114953.xml
http://marklogic.com/cpf/pipelines/1707825679528566193.xml
http://marklogic.com/cpf/pipelines/17486255598951175231.xml
http://marklogic.com/cpf/pipelines/1789191734187967847.xml
http://marklogic.com/cpf/pipelines/2145494300111008849.xml
http://marklogic.com/cpf/pipelines/2272288885870389220.xml
http://marklogic.com/cpf/pipelines/2585221667797881502.xml
http://marklogic.com/cpf/pipelines/4684095308382280821.xml
http://marklogic.com/cpf/pipelines/6055693256331806191.xml
http://marklogic.com/cpf/pipelines/7250675434061295808.xml
http://marklogic.com/cpf/pipelines/7354167915842037706.xml
http://marklogic.com/cpf/pipelines/7492839190910743342.xml
http://marklogic.com/cpf/pipelines/8329675320036351600.xml
http://marklogic.com/cpf/pipelines/8537493622930387355.xml
http://marklogic.com/cpf/pipelines/8877791654658876902.xml
http://marklogic.com/cpf/pipelines/8988716724908642408.xml
http://marklogic.com/cpf/pipelines/9432621469736814202.xml
http://marklogic.com/xdmp/triggers/10905847201437369653
http://marklogic.com/xdmp/triggers/11663386212502595308
http://marklogic.com/xdmp/triggers/12471659507809075185
http://marklogic.com/xdmp/triggers/15932603084768890631
http://marklogic.com/xdmp/triggers/16817738273312375366
http://marklogic.com/xdmp/triggers/17731123999892629453
http://marklogic.com/xdmp/triggers/6779751200800194600

Files created by CPF

http://marklogic.com/cpf/configuration

One of these files is the CPF configuration.xml file

http://marklogic.com/cpf/domains

One of these documents describes the default domain which is created when CPF is installed:

Default Documents
http://marklogic.com/cpf/pipelines

Of the 39 files created, we can see from the URI listing above that the majority (28) of these are prefaced with http://marklogic.com/cpf/pipelines. These files describe each of the standard conversion pipelines that ship with the server. These are:

Alerting
Alerting (spawn)
Calais Entity Enrichment Sample
Conversion Processing
Conversion Processing (Basic)
Data Harmony Enrichment Sample
DocBook Conversion
Document Filtering (Properties)
Document Filtering (XHTML)
Entity Enrichment
Flexible Replication
HTML Conversion
Janya Entity Enrichment Sample
MS Office Conversion
Office OpenXML Extract
PDF Conversion
PDF Conversion (Image Batching)
PDF Conversion (Page Layout with Reblocking)
PDF Conversion (Page Layout, Image Batching)
PDF Conversion (Page Layout)
PDF Conversion (Paged Text, No Rendering)
Schema Validation
SRA NetOwl Entity Enrichment Sample
Status Change Handling
Temis Entity Enrichment Sample
WordprocessingML Process
XHTML Conversion Processing
XInclude Processing
http://marklogic.com/xdmp/triggers

Seven of the files are triggers - all of which are namespaced with the cpf prefix:

cpf:any-property Default Documents
cpf:create Default Documents
cpf:delete Default Documents
cpf:restart
cpf:state Default Documents
cpf:status Default Documents
cpf:update Default Documents

Removing the core files created when CPF was initially installed will disable it from further functioning in your environment.

Scripting the removal of default CPF components

This GitHub gist demonstrates a method for removing CPF configuration from a given database - in the example below, the "Triggers" database is specfied: