Discrepancy in Data getting from the API

Hi!

I am trying to create this report in google sheets and getting the data from API.

Right now, I am getting activities of each user from the start and then filtering based on update time. I am getting discrepancy in the data specially in the number of calls. The number of tasks and meetings seem fine.

You can see here that the numbers for calls are way too less.

Is there an error I am making?

Let me know if there is anything I can do to get the numbers right.

Thanks

Hello,

One thing I’m noticing right off the bat, is that the dates don’t seem to match in the examples. In the PD UI, you’ve got August 1-7 and in the data extracted, it seems to be from September 3rd (the date it was gathered, I assume).

Are you sure matching dates (and any other filters)?

Hi David,

The date in the second screenshot is just when the output was printed, it is not related to the data.

Also, can you please tell me what start date and end date parameters of the api represent in the actual data? I can see a number of dates. For example, due date, update time and completed time.

Thanks

Hey @Analytics_ML,

Sorry for the wait, I was on vacation. As for your question about start/end, this has to do with when an activity is scheduled and it’s duration (the start and end of an activity)

image

As for the results, do you have something that you can share that directly shows the discrepancy and how your filtering on both results you get?

I am trying to fetch two data sets using the pipedrive api

The first dataset is getting the response from the following URL and returning 10,000+ rows
https://api.pipedrive.com/v1/activities?&start_date=2019-08-2018

and the second dataset is returning 3000 only with as a response to the following URL
https://api.pipedrive.com/v1/activities?&start_date=2019-08-2018&end_date=2019-08-2018

Is it because there is a problem with the end_date? I am using the pagination correctly for both the datasets. it’s the same code just different URLs. Can you tell me what the problem here might be?

Sorry, I just looked at what I wrote to you before and see that it was wrong as I misread what you had originally written.

So the start_date and end_date here is just the date range you wish to search within (according to due_date).

The date format you should be using though is YYYY-MM-DD, but it seems like you’re using YYYY-MM-YYYY instead.

Have you tried YYYY-MM-DD?

Sorry there was a typo in my URLs

The first dataset is getting the response from the following URL and returning 10,000+ rows
https://api.pipedrive.com/v1/activities?&start_date=2018-08-01’

and the second dataset is returning 3000 only with as a response to the following URL
https://api.pipedrive.com/v1/activities?&start_date=2018-08-01&end_date=2019-09-16’

What could be the reason behind this significant difference in the number of rows?

Does your account have several activities that haven’t been marked done?
That could be a reason for the number difference.

There are some activities that have not been marked as done but the number is even less than 1000.

The difference is way more than this.

I’m still investigating, but you’re positive that there’s not future due_dates outside of the timeframe that aren’t being picked up?
I’m testing and can’t seem to replicate on my end.