Why do two charts (Table and Bar Chart) display different data? - looker-studio

I have a strange issue in Google Data Studio where stacked column/bar charts are not displaying data correctly. I'm not sure if it's the way I'm setting it up or a bug. I've linked the Google Sheet and data page below.
Data Set (Google Sheets):
Date
Roll
Roller
1/4/2022
12
Oscar
1/5/2022
11
Oscar
1/6/2022
10
Oscar
1/7/2022
3
Oscar
1/8/2022
1
Oscar
1/9/2022
5
Oscar
1/10/2022
18
Dusty
1/11/2022
14
Oscar
1/12/2022
14
Oscar
1/13/2022
14
Oscar
1/14/2022
3
Dusty
1/15/2022
2
Dusty
1/16/2022
14
Oscar
1/17/2022
14
Chris
1/18/2022
8
Oscar
1/19/2022
9
Oscar
1/20/2022
14
Oscar
1/21/2022
7
Chris
1/22/2022
10
Alex
Google Data Studio Report
As seen "Chris" has two Roll inputs (14 and 7) on the table but in the graph it's only showing one:

In a Bar Chart, the default number of "Series" (unique values in the Breakdown Dimension field, in this case, the field titled Roll) is set to 10 and can be changed from the Style tab (with the current maximum of 20, which will be sufficient in this scenario as the Roll field has a COUNT_DISTINCT value of 12).
Editable Google Data Studio Report (Embedded Google Sheets Data Source) and a GIF to elaborate:

Because you're using No. series default of the bar while the number of series in your data is higher than this default. So, there is only the top series are shown. You can see more detail here

Related

How can I create an option to filter out NULL values in a field?

I have used a Google Sheet to create a data source for a dashboard in Google Data Studio:
Account_Number
Process_Date
Business_Unit
Budget_Reference
Vender_Name
1000001
7/1/2017
113111
0
ABCD Plumbing
1000002
7/9/2017
114122
0
ACME-1 Electric
1000003
6/14/2017
114223
1
1000004
5/11/2017
112444
1
Shark Industries
1000005
5/12/2017
113334
2
Cyberdyne Systems
1000006
5/11/2017
114440
2
Ollivander's Wand Shop
1000007
5/9/2017
120001
2
1000008
5/17/2017
120009
2
Wayne Enterprises
1000009
4/4/2017
120005
3
Fun City - USA
1000010
4/15/2017
120014
3
1000011
3/11/2017
120111
3
I used it to build a table and now want to use an advanced filter control to filter out rows containing blank vendor names:
Account_Number
Process_Date
Business_Unit
Budget_Reference
Vender_Name
1000001
7/1/2017
113111
0
ABCD Plumbing
1000002
7/9/2017
114122
0
ACME-1 Electric
1000004
5/11/2017
112444
1
Shark Industries
1000005
5/12/2017
113334
2
Cyberdyne Systems
1000006
5/11/2017
114440
2
Ollivander's Wand Shop
1000008
5/17/2017
120009
2
Wayne Enterprises
1000009
4/4/2017
120005
3
Fun City - USA
Google Data Studio Report:
Use either approach (1 or 2), based on the requirement:
Flexible - Checkbox: Provides the option for users to switch NULL values on or off
Fixed - Filter: Use if the aim is to hide NULL values
1) Flexible: Control Box
To create a toggleable feature, a checkbox control can be used, with the control field:
IF(Vendor_Name IS NULL, FALSE, TRUE)
The calculated field above uses the IF function to detect NULL values in the Vendor_Name field, which then filters data based on the checkbox selection:
⊟: (Default) all data
☑: Data excluding NULL values
▢: Only NULL values
To view all data, click the ↶ Reset button on the report header.
Optionally, use report links to ensure that users start with a default view:
NULL values excluded by default
Only NULL values shown by default
Editable Google Data Studio Report (Embedded Google Sheets Data Source) and a GIF to elaborate:
2) Fixed: Filter
To hide NULL values from users (without the ability for users to toggle), this filter would do:
Exclude Vendor_Name is NULL
Editable Google Data Studio Report (Embedded Google Sheets Data Source) and a GIF to demonstrate:

Excel: Change the color of the cell based on a threshold reached and matching to a string

Activity
Employee
Week of May 17
Week of May 24
Inbox
Alice
3
4
Inbox
Jane
5
8
Alpha Project
Alice
10
3
Beta Project
Francis
7
5
Chi Project
Jane
4
3
I've attempted to use conditional formatting, arrays & Vlookups and unable to cleanly get the following end result.
The End result is to flag anybody working > 10 hours for a given week.
Table is above.
End result should change the color of a cell titled "Alice" outside of this table because Alice worked 13 hours during Week of May 17.
End result should change the color of a cell titled "Jane" outside of this table because Jane worked 11 hours during Week of May 24.
Francis worked 10 hours or below, so no action is needed.
Any help on this is much appreciated --
Create a condition with the following formula:
=SUMIF(B2:B6,F2,C2:C6)>10
Where B2:B6 is the column of Employee names, F2 is the cell you want coloured, and C2:C6 is the column of the May 17th week

Higlight the dominating number in excel, most repeated for each keyword

Is this possible using excel formulas? To find keyword and number then match and color the highest number for that specific keyword, e.g. below:
this is the list Cell A keyword and B numbers
shoes 9
shoes 5
shoes 3
furniture 2
furniture 4
furniture 5
beauty 6
beauty 8
health 35
health 4
health 2
grocery 3
grocery 2
computers 9
computers 7
laptop 2
laptop 11
laptop 2
laptop 6
pets 9
pets 3
books 5
books 5
shoes 9 Highlight this number
shoes 5
shoes 3
furniture 2
furniture 4
furniture 5 Highlight this number
beauty 6
beauty 8 Highlight this number
health 35 Highlight this number
health 4
health 2
grocery 3 Highlight this number
grocery 2
computers 9 Highlight this number
computers 7
laptop 2
laptop 11 Highlight this number
laptop 2
laptop 6
pets 9 Highlight this number
pets 3
books 5 ignore if its equal
books 5
You can use conditional formatting, choosing "Use a formula..." and use a formula such as =b1=maxifs($B$1:$B$100,$A$1:$A$100,a1). Be mindful of absolute vs. relative reference to ensure that you're tracking the right ranges.
In particular when tagged vba you should be showing what you have tried. macros Usage guide specifically states "DO NOT USE for VBA / MS-Office languages" and excel wiki states "Questions tagged with excel should be version-agnostic.". However, with a formula is possible in versions earlier than those with MAXIFS (ie not: Excel for Office 365 Excel for Office 365 for Mac Excel 2016 Excel 2016 for Mac Excel Online Excel for iPad Excel for iPhone Excel for Android tablets Excel for Android phones Excel Mobile), if in a more long-winded way:
Assuming you have 11 in B18. Add a column (say I) and populate I1 with 0 and enough of it from I2 downwards with:
=IF(A1<>A2,I1+1,I1)
copied down to sort your data on ColumnI Smallest to Largest then by ColumnB Largest to Smallest (to preserve the order of the values in ColumnA).
Then select B2 down to as far as required, clear any existing CF rules from it and HOME > Styles - Conditional Formatting, New Rule..., Use a formula to determine which cells to format and Format values where this formula is true::
=AND(A1<>A2,B2<>B3)
Format..., select choice of formatting, OK.
The above should not, as specified, highlight the values for books though if working I suspect #nutsch's current answer might.
Sorry, I forgot to adjust my guess for what was where, once I realised a header row would make things easier.
This does though stil have a problem, in that text that changes from one row to the next but shares the same quantity, one row to the next, will not trigger highlighting - a more complex formula may be required.
based on #pnuts idea, found a simpler way to do it.
Sort Z to A of B row, then sort column A, A to Z, with expand the selection for both
next write a formula to highlight duplicates excluding the first one from column A and drag down the formula, it higlights all the correct ones.
thank you

Sequences in Graph Database

All,
I am new to the graph database area and want to know if this type of example if applicable to a graph database.
Say I am looking at a baseball game. When each player goes to bat, there are 3 possible outcomes: hit, strikeout, or walk.
For each batter and throughout the baseball season, what I want to figure out is the counts of the sequences.
For example, for batters that went to the plate n times, how many people had a particular sequence (e.g, hit/walk/strikeout or hit/hit/hit/hit), and if so, how many of the same batters repeated the same sequence indexed by time. To further explain, time would allow me know if a particular sequence (e.g. hit/walk/strikeout or hit/hit/hit/hit) occurred during the beginning of the season, in the mid, or later half.
For a key-value type database, the raw data would look as follows:
Batter Time Game Event Bat
------- ----- ---- --------- ---
Charles April 1 Hit 1
Charles April 1 strikeout 2
Charles April 1 Walk 3
Doug April 1 Walk 1
Doug April 1 Hit 2
Doug April 1 strikeout 3
Charles April 2 strikeout 1
Charles April 2 strikeout 2
Doug May 5 Hit 1
Doug May 5 Hit 2
Doug May 5 Hit 3
Doug May 5 Hit 4
Hence, my output would appear as follows:
Sequence Freq Unique Batters Time
----------------------- ---- -------------- ------
hit 5000 600 April
walk/strikeout 3000 350 April
strikeout/strikeout/hit 2000 175 April
hit/hit/hit/hit/hit 1000 80 April
hit 6000 800 May
walk/strikeout 3500 425 May
strikeout/strikeout/hit 2750 225 May
hit/hit/hit/hit/hit 1250 120 May
. . . .
. . . .
. . . .
. . . .
If this is feasible for a graph database, would it also scale? What if instead of 3 possible outcomes for a batter, there were 10,000 potential outcomes with 10,000,000 batters?
More so, the 10,000 unique outcomes would be sequenced in a combinatoric setting (e.g. 10,000 CHOOSE 2, 10,000 CHOOSE 3, etc.).
My question then is, if a graphing database is appropriate, how would you propose setting up a solution?
Much thanks in advance.

Microsoft SSAS Average Calculation in Cube

I m very new to cube development in SSAS. I m using Microsoft BIDS 2008.
I have built a small cube, which is as mentioned below:
India Pakistan GrandTotal
Apr 6 10 16
May 5 6 11
I want to add a field called as average to be added beside Grand total
India Pakistan GrandTotal Average
Apr 6 10 16 8
May 5 6 11 5
Any inputs on this would be helpful. 5.5 in average is truncated to 5.
Thanks !!!
Create a calculated member that divides current measure by the count of members in your measure group (normally a count measure is automatically created when you add a measure group).
Truncation can be handled by the FORMAT_STRING property of that calculation or by using MDX functions.
More info on calculated members:
http://technet.microsoft.com/en-us/library/ms166568(v=sql.105).aspx

Resources