Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes! Register now.

Reply
kellyylx
Helper I
Helper I

keep only characters and numbers in column

Hi i am working with data in Korean and would like to remove punctuations while keeping the characters and numbers in a column. It would be great if i can do this in one step rather than removing each type of punctuation one by one. 

 

i tried using this but realise it doesnt work for languages that does not use the alphabet (Korean/Japanese)

=Text.Select([Column1],{"a".."z","A".."Z","0".."9"})

 

Is there another way to do so?

1 ACCEPTED SOLUTION
OwenAuger
Super User
Super User

Hi @kellyylx 

I would recommend creating a separate list query containing all characters you want to keep.

You would need to identify all ranges of Unicode characters you want to keep, which may require a little research.

 

For example, here is a list containing (probably) all Korean characters plus the alphabet and numerals. You can add/remove whatever is required:

// CharactersToKeep
{
  Character.FromNumber(0x1100)..Character.FromNumber(0x11FF), // Hangul Jamo
  Character.FromNumber(0xa960)..Character.FromNumber(0xA97F), // Extended Hangul Jamo 1
  Character.FromNumber(0xD7B0)..Character.FromNumber(0xD7FF), // Extended Hangul Jamo 2
  Character.FromNumber(0xAC00)..Character.FromNumber(0xD7A3), // Hangul Syllables
  Character.FromNumber(0x3130)..Character.FromNumber(0x318F),  // Hangul Compatibility Jamo
  "0".."9", // Numerals
  "a".."z", // Alphabet lowercase
  "A".."Z" // Alphabet uppercase
}

Then reference this list as the 2nd argument of Text.Select:

Text.Select([Original Text],CharactersToKeep)

Small example attached.

 

Regards


Owen Auger
Did I answer your question? Mark my post as a solution!
Blog
LinkedIn

View solution in original post

1 REPLY 1
OwenAuger
Super User
Super User

Hi @kellyylx 

I would recommend creating a separate list query containing all characters you want to keep.

You would need to identify all ranges of Unicode characters you want to keep, which may require a little research.

 

For example, here is a list containing (probably) all Korean characters plus the alphabet and numerals. You can add/remove whatever is required:

// CharactersToKeep
{
  Character.FromNumber(0x1100)..Character.FromNumber(0x11FF), // Hangul Jamo
  Character.FromNumber(0xa960)..Character.FromNumber(0xA97F), // Extended Hangul Jamo 1
  Character.FromNumber(0xD7B0)..Character.FromNumber(0xD7FF), // Extended Hangul Jamo 2
  Character.FromNumber(0xAC00)..Character.FromNumber(0xD7A3), // Hangul Syllables
  Character.FromNumber(0x3130)..Character.FromNumber(0x318F),  // Hangul Compatibility Jamo
  "0".."9", // Numerals
  "a".."z", // Alphabet lowercase
  "A".."Z" // Alphabet uppercase
}

Then reference this list as the 2nd argument of Text.Select:

Text.Select([Original Text],CharactersToKeep)

Small example attached.

 

Regards


Owen Auger
Did I answer your question? Mark my post as a solution!
Blog
LinkedIn

Helpful resources

Announcements
September Power BI Update Carousel

Power BI Monthly Update - September 2025

Check out the September 2025 Power BI update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.

Top Solution Authors
Top Kudoed Authors