CrystalLang-Regex01

CrystalLang-Regex01

·

3 min read

A regular expression is a sequence of characters that define a search pattern, mainly for use in pattern matching with strings. Ruby regular expressions i.e. Crysal regex for short, helps us to find particular patterns inside a string. Two uses of Crysal regex are Validation and Parsing. Crysal regex can be used to validate an email address and an IP address too. Crysal regex expressions are declared between two forward slashes.

Syntax:

# finding the word 'hi'
"Hi there, i am using gfg" =~ /hi/

This will return the index of first occurrence of the word ‘hi’ if present, or else will return ‘ nil ‘. Checking if a string has a regex or not

We can also check if a string has a regex or not by using the match method. Below is the example to understand.

# Crysal program of regular expression 

# Checking if the word is present in the string 
if "hi there".match(/hi/) 
    puts "match"
end

Output:

match

Checking if a string has some set of characters or not

We can use a character class which lets us define a range of characters for the match. For example, if we want to search for vowel, we can use [aeiou] for match.

# Crysal program of regular expression 

# declaring a function which checks for vowel in a string 
def contains_vowel(str) 
str =~ /[aeiou]/ 
end

# Driver code 

# Geeks has vowel at index 1, so function returns 1 
puts( contains_vowel("Geeks") ) 

# bcd has no vowel, so return nil and nothing is printed 
puts( contains_vowel("bcd") )

Output:

1

There are different short expressions for specifying character ranges :

\w is equivalent to [0-9a-zA-Z] \d is the same as [0-9] \s matches white space \W anything that’s not in [0-9a-zA-Z] \D anything that’s not a number \S anything that’s not a space The dot character . matches all but does not match new line. If you want to search . character, then you have to escape it.

# Crysal program of regular expression 

a="2m3"
b="2.5"
# . literal matches for all character 
if(a.match(/\d.\d/)) 
    puts("match found") 
else
    puts("not found") 
end
# after escaping it, it matches with only '.' literal 
if(a.match(/\d\.\d/)) 
    puts("match found") 
else
    puts("not found") 
end

if(b.match(/\d.\d/)) 
    puts("match found") 
else
    puts("not found") 
end

Output:

match found not found match found

or matching multiple characters we can use modifiers:

  • is for 1 or more characters
  • is for 0 or more characters ? is for 0 or 1 character {x, y} if for number of characters between x and y i is for ignores case when matching text. x is for ignores white space and allows comments in regular expressions. m is for matches multiple lines, recognizing newlines as normal characters. u,e,s,n are for interprets the regexp as Unicode (UTF-8), EUC, SJIS, or ASCII. the regular expression is assumed to use the source encoding, If none of these modifiers is specified.

source : geeksforgeeks.org/ruby-regular-expressions/..

code is tested on:play.crystal-lang.org