Backup Via API

Adam_Metz · May 17, 2019, 9:16pm

Is there any way to run hourly backups using the API? Has anyone ever done this?

dani · May 20, 2019, 9:08am

Do you mean backup of your Pipedrive data?
If you do, I would ask why you want to back it up in the first place (we already do that for you).

But if for some reason you want to have the data duplicated and in sync somewhere, you can do it.
You should just make sure you don’t exceed the rate limit.

And using webhooks you can receive the updated data without affecting the rate limit.

Adam_Metz · May 20, 2019, 6:53pm

Dani, can you give my developer Dhruv and I step by step instructions to go from the current daily backups to hourly backups? Also, would once per hour exceed current rate limits?

dani · May 21, 2019, 8:25am

I’m sorry, Adam. I’m not really sure what you mean. But let’s see if I can help.

What’s your end goal?

dani · May 21, 2019, 8:28am

If you’re talking about the backups that we do automatically, you can’t change the time interval on those…

dani · May 21, 2019, 8:29am

And if you mean keeping your data in sync, if you use webhooks you can have it updated in realtime without affecting the rate limit.

Adam_Metz · May 21, 2019, 7:12pm

Dani, it’s very simple. All we want to do is to make an hourly backup of our Pipedrive instance, that’s it!

TimMunro · May 21, 2019, 8:11pm

Hey Adam, there is not a “backup all my pipedrive data” API call, rather you need to identify which items you wish to back up, then call the appropriate API endpoint for that data. As long as you throttle calls (https://pipedrive.readme.io/docs/core-api-concepts-rate-limiting) you will not hit the rate limits.

We sometimes use the API to extract all available data; if the account is large it can take many, many hours to extract all the data - so it may not be practical to export via the API every hour. The “Recents” API endpoint allows determination of what changed recently for some objects, and so can be used to make the extract more efficiently. Feel free to PM me if you would like to hear how we do this.

Adam_Metz · May 21, 2019, 8:32pm

Tim, thanks very much. I will have Dhruv stay in touch with you as we build this.

Dhruv, let’s build this now, and I’ll ask Sina which bucket to put it in.

Andressa_Mello · August 13, 2019, 8:32pm

Can I ask, how do you do that?

TimMunro · August 14, 2019, 4:48am

Hey Andressa - we (Trujay) use purpose built SaaS data integration and data migration products to handle these kinds of use cases. These simplify access to Pipedrive records by providing connectors that, for example, automatically rate-limit calls to the Pipedrive API. As this question is about backing up a relevant example showing these connectors work may be to iterate through recently changed Pipredive organizations and save each changed organization as a JSON document to a folder on Amazon S3:

for await (const org of pd.query.Organizations({rowFilter: {byRecent: {since_timestamp: "2019-08-14 11:23:12"}}})){
  // save the changed org to amazon s3 as JSON file
  const orgAsJSONFile = ctx.utilities.fileProvider({string: JSON.stringify(org.data)});
  await s3.upsert.Files([{Key: "/backups/" + org.data.id + ".json", File: orgAsJSONFile}]);
}

Under the hood, this uses Pipedrives Get “Recents” API to efficiently extract only records changed since the last run (the “since_timestamp” above would be a parameter). It is worth noting that not all Pipedrive record types expose data via the “Recents” API, and so to extract those records may take quite some time (e.g. many hours)