How to delete the repeated occurrences of a file in documentum if it exists more than once using dql query in java?

836 Views Asked by At

I want to delete repeated occurrence of a file in documentum leaving only one file if it exists more than once.

Following is my code.

import com.documentum.fc.client.IDfQuery;
import com.documentum.fc.client.IDfSession;
import com.documentum.fc.client.IDfSessionManager;
import com.documentum.fc.common.DfException;
import com.documentum.fc.common.DfId;
import com.documentum.fc.common.IDfLoginInfo;
import com.documentum.operations.IDfDeleteOperation;

public class CountFiles {

    // Documentum target repository where the files will be imported
    private static final String REPO_NAME = "rep";



    public static void main( String[] args ) throws Exception {

        try {
            String username = "user";
            String password = "pwd";

            System.out.println( "Starting to connect ..." );

            IDfSessionManager sessMgr = createSessionManager( );
            addIdentity( sessMgr, username, password);
            IDfSession sess = sessMgr.getSession(REPO_NAME );
            System.out.println( "Successfully connected to the server.");
            queryDocumentum(sess);


        } catch( Exception ex ) {
            System.out.println( ex );
            ex.printStackTrace( );
        }
    }


    private static void queryDocumentum(IDfSession sess) throws DfException {
    IDfQuery query = new DfQuery();
String queryStr= "select count(*) from dm_document where folder('/XXX/YYY', DESCEND) and object_name = 'abc.pdf' ";
    query.setDQL(queryStr);
    IDfCollection coll = query.execute(sess,IDfQuery.DF_EXEC_QUERY);
    while(coll.next())
    {
      int count = coll.getValueAt(0);
      if(count>1)
      {
       String qry = "delete dm_sysobject (all) objects where object_name='abc.pdf';";

        IDfQuery q= new DfQuery();
        query.setDQL(qry);
        IDfCollection col = query.execute(sess,IDfQuery.DF_EXEC_QUERY);


      }
    }
    coll.close();
}
/**
     * Creates a new session manager instance. The session manager does not have
     * any identities associated with it.
     *
     * @return a new session manager object.
     * @throws DfException
     */
    private static IDfSessionManager createSessionManager( ) 
            throws Exception {
        IDfClientX clientX = new DfClientX( );
        IDfClient localClient = clientX.getLocalClient( );
        IDfSessionManager sessMgr = localClient.newSessionManager( );

        System.out.println( "Created session manager." );

        return sessMgr;
    }

    /**
     * Adds a new identity to the session manager.
     *
     */
    private static void addIdentity( final IDfSessionManager sm, 
            final String username, final String password ) 
            throws Exception {
        IDfClientX clientX = new DfClientX( );

        IDfLoginInfo li = clientX.getLoginInfo( );
        li.setUser( username );
        li.setPassword( password );

        // check if session manager already has an identity.
        // if yes, remove it.
        if( sm.hasIdentity( REPO_NAME ) ) {
            sm.clearIdentity( REPO_NAME );

            System.out.println( "Cleared identity on :" + REPO_NAME );
        }

        sm.setIdentity( REPO_NAME, li );

        System.out.println( "Set up identity for the user." );
    }

}

But something is wrong in the way that I am doing the operation. It is not working.I am not giving the path of the file here because I don't know the exact path of the file.Without giving the path of the file is it possible to delete all the occurrences of the file except one.

2

There are 2 best solutions below

0
Henning Winter On

If you are coding the logic in Java anyway have a look at the IDfOperatiins. Also this method for doing bulk stuff is described well in the guides.

0
Emilian Utma On

I would suggest the following changes in your DQL and in the logic, this is mostly because you are using DFC API:

DQL:

//Retrieves all the root object ids of the documents version tree where there are duplicates as their name. 
//Here I thought the latest created document is the one to keep, you can adapt it to your needs - **see ORDER by clause**.

    select i_chronicle_id from dm_document where folder('/XXX/YYYY', DESCEND) and object_name in (select object_name from dm_document where folder('/XXX/YYYY', DESCEND) and object_name = 'abc.pdf' group by object_name having count(*) > 1) order by r_creation_date asc;

Logic:

//"pseudo-code" - deletes the entire version tree of the document that is not the last in the returned list
while 'not last i_chronicle_id from the result collection' 
     //execute this dql: 
     `delete dm_document (all) objects where i_chronicle_id='<i_chronicle_id from the list>';`

Hope this helps,

Emilian

P.S. The DQLs I have tested against a CS 7.3