Google BigQuery app_events_ and app_events_intraday_ stuck with old data - firebase

I recently started using BigQuery with Firebase and Google analytics and it was working fine few days ago, but not anymore. As I can see, there are 2 tables:
app_events_ that should contains history and
app_events_intraday_ that should be current day (realtime)
Today is 13 of July and app_events_ is "stuck" on 8 of July:
and the app_events_intraday_ is "stuck" with data of 9 of July and 10 of July:
Is this normal behavior? Has anyone experience with this problem? Is possible that is bug in the platform?

Recently Firebase changed the way they send data to BigQuery, they used to create a dataset per platform and the tables app_events and app_events_intraday will go in that dataset. Now they send the data in the dataset "analytics_ which contains events and events_intraday tables for all platforms. You may want to check this dataset.

Related

Events table is created too late in BigQuery

I am exporting my app's events from Firebase to BigQuery on a regular basis. This system creates an 'events_intraday_YYYYMMDD' table for the current day we are in, and for the previous days, the data is stored in 'events_YYYYMMDD' tables, one for every day. When the day ends, the events_intraday table is converted to an events table, and a new intraday table is created for the new day. This normally happens around early in the morning.
Recently, this process started to take too long. The new intraday tables are created when they should've been, but the older ones are not converted to events tables. For example, today (15 Apr 2021) I still have the events table for the 14th as an intraday table, which should've been saved as an events table much earlier. In the last couple of days, intraday tables are kept as long as 2-3 days.
I've looked at every documentation in Firebase, Google Analytics, and BigQuery. But could not find a solution to this, nor how to edit the settings for this if there is any. Firebase says that I should check the 'event export settings' in Google Analytics, but there is no such option there.
Anyone else ever experienced the same problem?

Firebase BigQuery data range

I've been directed to post any questions on Stack overflow from Google regarding BigQuery
I have an app linked to Firebase collecting a few custom events. This has been running since August 2019. I've linked my firebase project to Big Query (Sandbox)
I've run a query to get a breakdown of 2 events which works fine BUT only for the last 2 days. When I put in a date range from say 1st August 2019 to 1st April 2020, I get the exact same number. If I choose a date range prior to the last 2 days I get no results.
I believe that only the last 2 days worth of data is being linked across.
How do I get access to all the data I have in Firebase analytics in Big Query? Is this a Sandbox limitation?
Thanks

Yesterday's data from BigQuery

We are having some issues pulling yesterday's Google Analytics data from BigQuery. Can anyone explain at what point a previous day's GA data is finalized?
There is some explanation here of the intraday tables, but it's not very clear:
https://support.google.com/analytics/answer/3437719?hl=en
To get previous day data do you need to need to use the intraday tables at all? Do you have access to the fully processed dataset at 8am local time? Or is it 8 hours after the current day UTC+14:00 (etc)?
I had a similar question and asked their support, this is the reply:
"According to this Google Analytics documentation , it states that '1 file will be exported each day that contains the previous day’s data, and 3 files will be exported each day that contain the current day's data'. In such, the minimum time that the data from Google Analytics to be exported to BigQuery was 8 hours. Although Google Analytics can be linked to BigQuery, the availability of data depends on how it was served by Google Analytics 360."
But based on experience, it's really a minimum time. Sometimes there are delays of 4-5 hours.
My team has been pressing Google's support for providing SLA's for BigQuery dump, so they updated the documentation:
This feature is not governed by a service-level agreement (SLA).
In practice we are experiencing regular delays anywhere between 2 to 12 hours.

Google API Returning Sampled Data

I'm using the Google-Analytics API to query my analytics for data using the Google Analytics Spreadsheet Add-on We then use the spreadsheet data in Google Data studio for a dashboard to display the data.
Everything has been going well for the last few months, however over the last 48 hours we have begun to receive sampled data when we query the API using the spreadsheet add on. This is undesirable for how we are using the data.
The total results that we were getting before being returned was about 1100 results. We have altered the date range of the query to be only 3 days whereas before we were querying since the start of the year.
Initially that worked and the results were no longer sampled. Then 24 hours later the data appears to be sampled again.
The documentation says the following regarding sampling for the free account:
Analytics Standard: 500k sessions at the view level for the date range you are using
We are not using our analytics that heavily so I cannot understand why we would have hit the 500K limit?
It is also not clear to me what a "View Level" is? Any help on this would be greatly appreciated.

Last “end date” with data in Analytics

I'm using "Reporting google Analitics API" and I can’t find information about what the last “end date” with data in Analytics is.
For example, let's suppose you want to retrive the last month’s data.
When do you have to perform the query?
The first day of the current month?
...or the second one?
...or maybe the third one?
And only another question: are the returned data for days in pacific time?
Google Analytics API is supposed to have access to the same data you have in the interface.
Google says that data can take up to 24h to process. The time it takes to really update the data depends on the type and size of the account. Small accounts are updated multiple times a day and can have data available in just a few hours. Once you reach 1M hits a month you are moved to a different mode where the data on your account is updated only once a day. Google Analytics Premium customers have updates more often even for large ammounts of traffic.
There's no way to tell through the API what is exactly the time of the last hit processed. You can query the data for today by the hour and see for yourself though.
Usually you don't care and just want to make sure that the data you're querying has been fully processed for that day.
So if you query data for yesterday there's a chance it has not being completely updated, for example if it's midnight the data for yesterday is just a couple minutes ago and probably haven't been completely processed yet. The safest bet in this case is to query data for 2 days ago.
So if today is 2012-06-15 and you want to get 1 month of data a safe approach is to query data with start-date=2012-05-13 and end-date=2012-06-13. This will most of the time give you data for days that have been fully processed, but it's not 100% safe as well. Google Analytics have had outages in the past where data took longer than that to process, these are not usual though. When you get the data out it's really hard to tell just for the API if the data for those days have been fully processed or not, using the 2 days ago isea you just make it more likely that it is.
The days are aggregate following your timezone settings configured on the Google Analytics profile.

Resources