cancel
Showing results for 
Search instead for 
Did you mean: 

Fabric is Generally Available. Browse Fabric Presentations. Work towards your Fabric certification with the Cloud Skills Challenge.

Reply
KelvinMorel
Helper II
Helper II

Nested duplicated MLB Players list

Hi,

I’m having a crazy “Duplicated” challenge but I’m out of options, I have a list of MLB players stats since early 90's, for each year I have the same player, till they are actif of course, I would like to create a list of unique MLB players to use as dimension table.

 

 I've extracted list of names, handedness and positions attributes: 2 issues where the first can’t be solved because of the second.

  1. Players could play multiple positions, ex. Adam Dunn played OutField [OF], InField [IF] and Designated Hitter [DH]
    I could removed duplicated based on [FullName] + [Handedness] but…
     
  2. Also multiple players could share the same name, ex. Adam Eaton Pitcher and Adam Eaton OF
    but also players could share the same name and handedness, ex. Will Smith Pitcher and Will Smith Catcher

    KelvinMorel_1-1632830278571.png

    KelvinMorel_2-1632830303085.png

     

Sample of this list

 

Yeah I know… Other options are welcome!

 

Thx,

2 REPLIES 2
KelvinMorel
Helper II
Helper II

Hi @HotChilli,

 

Thx, I like the "Primary position" suggestion. Add an additional piece of info for player isn't an option.

 

Grtz,

 

HotChilli
Super User
Super User

For Item 2, you need an additional piece of info to make each player unique e.g. birthdate, middle name, height or something.  Hopefully you can add it to your data early on in the process and without doing it manually but if that's the way it has to be done, then that's what you have to do.

 

Item 1 isn't really a problem - it just depends on what you are using the data for.  It works as a Fact table, if you want to keep track of the different positions players have.  If you want to use it as a dimension table, then use a primary position field or get rid of the positions and remove duplicates.

Helpful resources

Announcements
PBI November 2023 Update Carousel

Power BI Monthly Update - November 2023

Check out the November 2023 Power BI update to learn about new features.

Community News

Fabric Community News unified experience

Read the latest Fabric Community announcements, including updates on Power BI, Synapse, Data Factory and Data Activator.

Dashboard in a day with date

Exclusive opportunity for Women!

Join us for a free, hands-on Microsoft workshop led by women trainers for women where you will learn how to build a Dashboard in a Day!

Power BI Fabric Summit Carousel

The largest Power BI and Fabric virtual conference

130+ sessions, 130+ speakers, Product managers, MVPs, and experts. All about Power BI and Fabric. Attend online or watch the recordings.

Top Solution Authors
Top Kudoed Authors