(2008) Point-In-Time Recovery for Hadoop
The PIT recovery feature allows to make the state of a system consistent. This functionality allows to synchronize the Hadoop with the primary database. Perhaps the primary database has been rolled back to a certain moment and the archive is now in an inconsistent state with the primary database. This functionality analyzes the situation and chooses a timestamp to perform the rollback on an archive.
Before you can start to use these functionalities, you should enable Point-In-Time Recovery (PIT) in DataTiering Settings Tab.
Using the Task Editor
To synchronize the Hadoop with the primary database:
- Launch transaction /DVD/TH_EDIT, open the process /DVD/SM_CHL_CONS and hit F8.
- Create a new task with a new Run ID by clicking the button below (or by hitting F8 key).
- Name your Run ID.
- Now you are ready to run your PIT tasks.
If you look at the task hierarchy on the screen, you will see that the PIT recovery process contains 6 different subtasks divided in 2 groups:
- Analysis: This optional subtask analyzes what operations are required to make the archive consistent with the primary database.
- Execution: This subtask performs operations to make the archive consistent with the primary database.
For each subgroup there are three operations:
- Generate the analysis / synchronization tasks: You can insert the DataProviders you are testing as a parameter here and then execute a report which generates a task for each DataProvider.
- Analyze the DataProviders / Execute the synchronization process: Executes either the analysis tasks or the synchronization tasks that were previously generated on the DataProviders.
- Show the results of the analysis / synchronization: Displays a report detailing the operations that are completed on each DataProvider.
To delete data used for PIT recovery older than a certain timestamp, use report /DVD/SM_CHL_DELETE. The idea is schedule the execution of this job periodically to avoid growth of the database.
To clean the details of these tasks there is the report /DVD/SM_CHL_CLEAN_TASK which takes as input a set of Run IDs and cleans all information about them.
If for some reason you want to re-open and existing Run ID you can do it via transaction /DVD/TH_MONITOR and entering your Run ID (in our example "PIT_RUN") in the Task run ID field.