Different results between dimension Month and Date (Year Month) - google-data-studio

I am trying to generate a list of users but I get different values depending on which dimension I use.
The data source is a GA4 property, but it also occurs with UA.
This is my example: https://datastudio.google.com/reporting/f75c1478-e738-45ad-8a23-97c51b69c374
For the attached table, if I use the "Month" dimension, it returns a different value that I use the dimension "Date (Year Month).
Could someone explain me the difference and how I could fix it?
Thank you very much in advance!
Date (Year Month)
Users
New users
1
Aug 2022
70,469
52,103
2
Jul 2022
71,939
54,837
3
Jun 2022
75,868
57,613
4
May 2022
85,273
67,462
5
Apr 2022
82,099
63,977
Month
Users
New users
1
Aug 2022
58,674
52,025
2
Jul 2022
60,150
54,748
3
Jun 2022
62,643
57,498
4
May 2022
73,172
67,398
5
Apr 2022
69,164
63,893

Related

Calculating running (cumulative) sum with start and end dates over time

I'm looking for a way to calculate recurring revenue over time in a time plot where the X-axis is the date and Y-axis is the total amount of revenue (sum). The question we're trying to answer is "At any given point in time, what is the total amount of revenue coming in?"
Every line in the source data has an account name, subscription type (new, renewal, expansion), annual subscription amount, service start date, and service end date.
[Sample of data below]
Account
Type
Subscription Amount
Service Start
Service End
001
New
$ 1000
Jan 1, 2020
Dec 31, 2021
002
New
$ 2000
Feb 1, 2020
Jan 31, 2021
003
New
$ 3000
Apr 1, 2020
Mar 1, 2022
001
Expansion
$ 500
May 1, 2020
Dec 31, 2021
002
Renewal
$ 2500
Feb 1, 2021
Jan 31, 2023
004
New
$ 1000
Mar 1, 2021
Feb 28, 2023
002
Expansion
$ 250
Jun 1, 2021
Jan 31, 2023
001
Renewal
$ 1500
Jan 1, 2022
Dec 31, 2023
The main issue I'm running into is that when using either running sum or "cumulative" under the time plot settings panel, the total will continue running on the X-axis regardless of the service end date. This is a problem for situations like Account 002, which renews Feb 1st, 2021. Data Studio would (incorrectly) show an amount of $4500 for that account starting on Feb 1st -- where I want it to show only $2500 at that time.
Comparing what the data should be showing at any point in time vs. what actually happens:
Date
Amount I want to see
What is currently calculated
Jan 1, 2020
$ 1000
$ 1000
Feb 1, 2020
$ 3000
$ 3000
Apr 1, 2020
$ 6000
$ 6000
May 1, 2020
$ 6500
$ 6500
Feb 1, 2021
$ 7000
$ 9000
Mar 1, 2021
$ 8000
$ 10,000
Jun 1, 2021
$ 8250
$ 10,250
Jan 1, 2022
$ 8250
$ 11,750
Jan 1, 2023
$ 5250
$ 11,750
Is there a way to achieve this?
Ideally, the most elegant solution seems to be to tell the plot to only sum between dates (i.e. sum if "reported x-axis date" is between Service Start date and Service End date), but I don't know how to reference the X-axis value in a formula.
I have not yet tried blending the data but I'm thinking that may be another path forward as well?
Any help on this would be much appreciated!

How can I get % Growth from Previous Column in Google Data Studio?

I have this data set (Google Sheets):
category
February 2022
March 2022
April 2022
Gold
1457
1627
1890
Silver
1962
2415
3036
Bronze
5683
7520
10169
I want to show the % growth from each month (which is basically growth from the previous column).
For example:
Under Gold, March_2022 growth is:
(1627 - 1457)/1457 = 11.67%.
Expected outcome might be like this (Google Sheet):
category
February 2022
March 2022
April 2022
Gold
-
11.67%
16.16%
Silver
-
23.09%
25.71%
Bronze
-
32.32%
35.23%
How can I do that in Google Data Studio?
I have tried comparison calculation: percent difference from total (for both corresponding and base data) but it doesn't work:
My last option might be to hard code it in query, but I prefer to do that in Google Data Studio.
Google Data Studio report
The calculated fields below could be used (#1 ensures that values are displayed as NULL while #2 and #3 compare values between each month), setting the Type to Numeric > Percent; to display NULL values as -, select the respective symbol from the Missing Values section in the Style tab of the table:
1) February 2022
NULLIF(February 2022 / February 2022, 1)
2) March 2022
(March 2022 / February 2022) - 1
3) April 2022
(April 2022 / March 2022) - 1
Publicly editable Google Data Studio report (embedded Google Sheets data source) and a GIF to elaborate:

I have facing issue with date in angularjs, when I use central timezone(-6,-7,-8) getting issue with date

I have this date in startdate=2021-10-27T00:00:00-04:00,
d=new Date(data.StartDate) // outputTue Oct 26 2021 23:00:00 GMT-0500
But date is getting one day before'Tue Oct 26 2021 23:00:00 GMT-0500' in central timezone(below -6,-7,-8...),
How to resolve this date issue(26) in central timezone and for all timezones it is getting well. But only -6,-7... timezone have an issue

How to deal with the categorical varaible that is not in training set (new categories in test set)?

I want to create a categorical variable for the semester column in my dataset. I have other additional variables with the target-not shown in the table.
Training set: include 2016-2017
Test set or validation set: include only 2018
My Concern is when I make the predictive model I will have categorical variables (factors) that do not exist in the training set (i.e SPRING 2018, SUMMER 2018–First SESSION,...etc). Is this will be a problem theoretically? How to deal with that?
Start End Semester
Jan 19,2016 May 6,2016 SPRING 2016
May 16,2016 Jun 25,2016 SUMMER 2016-FIRST SESSION
Jun 27,2016 Aug 6,2016 SUMMER 2016-SECOND SESSION
Aug 24,2016 Dec 16,2016 FALL 2016
Jan 17,2017 May 5,2017 SPRING 2017
May 15,2017 Jun 24,2017 SUMMER 2017–First SESSION
Jun 26,2017 Aug 5,2017 SUMMER 2017-SECOND SESSION
Aug 23,2017 Dec 15,2017 FALL 2017
Jan 16,2018 May 4,2018 SPRING 2018
May 14,2018 June 23,2018 SUMMER 2018–First SESSION
Jun 25,2018 Aug 4,2018 SUMMER 2018-SECOND SESSION
Aug 22,2018 Dec 14,2018 Fall 2018
The machine learning algorithms learn patterns in data, if we do not have any repeated pattern then with high probability they failed to provide adequate answer. I think you need to transform adequate information to your model for getting a rational output. Regarding to your research question, it can be different:
For instance, If you want to answer the question of when is the starting and ending time of semester x in year y?
You can convert the semester column into 4 ordinal categorical variables of 1 to 4 for Spring to Fall. In addition you should provide a year column in your data and DD,MM for ending and starting time.

SSRS matrix group by error

I have an ssrs matrix with total in rows and months as columns .. something like this .
Jan Feb March
Total1
Total2
I can choose a month as a parameter and my matrix shows the results for the month and the two months before the current month.
When I choose August :
I can see June July August
When I choose September :
I can see July August September
But when I choose October
I see October August September
My client wants to see it as August September October
I don't know the reason for this behavior .But can anyone help me with this ??
I tried doing the sort for the tablix as follows :
=(Fields!Month.Value)
Month is a number like 1 for Jan, 2 for Feb. I add it as a sorting expression but I dont see any change in the sorting.
according to your data.
August = "8"
September "9"
October = "10"
it looks to me like you need to convert your month to an integer type, Sort is picking up your month as a string value.
try converting your month field to an integer in your sort expression and then running rerunning the report.
something like:
=CInt(Fields!Month.Value)
I finally found what was going wrong with my solution .
I was adding the sorting on the "tablix properties" section .
Sorting needs to be added on the "Column group" and not on the "tablix properties" and this did the trick in my case. :-)

Resources