Diff for "BashFAQ/054"

Differences between revisions 1 and 31 (spanning 30 versions)

How can I tell whether a variable contains a valid number?

First, you have to define what you mean by "number". The most common case when people ask this seems to be "a non-negative integer, with no leading + sign". Or in other words, a string of all digits. Other times, people want to validate a floating-point input, with optional sign and optional decimal point.

Hand parsing

If you're validating a simple "string of digits", you can do it with a glob:

# Bash
if [[ $foo != *[!0-9]* ]]; then
    echo "'$foo' is strictly numeric"
else
    echo "'$foo' has a non-digit somewhere in it"
fi

The same thing can be done in Korn and POSIX shells as well, using case:

# ksh, POSIX
case "$foo" in
    *[!0-9]*) echo "'$foo' has a non-digit somewhere in it" ;;
    *) echo "'$foo' is strictly numeric" ;;
esac

If you need to allow a leading negative sign, or if want a valid floating-point number or something else more complex, then there are a few possible ways. Standard globs aren't expressive enough to do this, but we can use extended globs:

# Bash -- extended globs must be enabled.
# Check whether the variable is all digits.
shopt -s extglob
[[ $var == +([0-9]) ]]

A more complex case:

# Bash
shopt -s extglob
[[ $foo = *[0-9]* && $foo = ?([+-])*([0-9])?(.*([0-9])) ]] &&
  echo "foo is a floating-point number"

The leading test of $foo is to ensure that it contains at least one digit. The extended glob, by itself, would match the empty string, or a lone + or -, which may not be desirable behavior.

Korn shell has extended globs enabled by default, but lacks [[, so we must use case to do the glob-matching:

# Korn
case $foo in
  *[0-9]*)
    case $foo in
        ?([+-])*([0-9])?(.*([0-9]))) echo "foo is a number";;
    esac;;
esac

Note that this uses the same extended glob as the Bash example before it; the third closing parenthesis at the end of it is actually part of the case syntax.

If your definition of "a valid number" is even more complex, or if you need a solution that works in legacy Bourne shells, you might prefer to use an external tool's regular expression syntax. Here is a portable version (explained in detail here), using egrep:

# Bourne
if echo "$foo" | egrep '^[-+]?([0-9]+\.?|[0-9]*\.[0-9]+)$' >/dev/null
then
    echo "'$foo' is a number"
else
    echo "'$foo' is not a number"
fi

Bash version 3 and above have regular expression support in the [[ command. Due to bugs and changes in the implementation of the =~ feature throughout bash 3.x, we do not recommend using it, but people do it anyway, so we have to maintain this example (and keep restoring this warning, too, when people delete it):

# Bash
# Put the RE in a var for backward compatibility with versions <3.2
regexp='^[-+]?[0-9]*(\.[0-9]*)?$' 
if [[ $foo = *[0-9]* && $foo =~ $regexp ]]; then
    echo "'$foo' looks rather like a number"
else
    echo "'$foo' doesn't look particularly numeric to me"
fi

Using the parsing done by [ and printf (or "using eq")

# fails with ksh
if [ "$foo" -eq "$foo" ] 2>/dev/null;then
 echo "$foo is an integer"
fi

[ parses the variable and interprets it as in integer because of the -eq. If the parsing succeds the test is trivially true; if it fails [ prints an error message that 2>/dev/null hides and sets a status different from 0. However this method fails if the shell is ksh, because ksh evaluates the variable as an arithmetic expression.

You can use a similar trick with printf:

# POSIX
if printf "%f" "$foo" >/dev/null 2>&1; then
  echo "$foo is a float"
fi

You can use %d to parse an integer. Take care that the parsing might be (is supposed to be?) locale-dependent.

-  ⇤ ← Revision 1 as of 2007-05-02 23:46:43 → 
  Size: 1429
  Editor: redondos
  Comment:
+   ← Revision 31 as of 2012-10-24 17:34:14 → ⇥
  Size: 4015
  Editor: ormaaj
  Comment: rm the entire "declare -i" section with dangerous examples. declare -i is NOT a method of testing for valid integers. It requires extra care - not recommended for beginners.
-Deletions are marked like this.
+Additions are marked like this.
 Line 1:
-[[Anchor(faq54)]]
+<<Anchor(faq54)>>
 Line 3:
+First, you have to define what you mean by "number".  The most common case when people ask this seems to be "a non-negative integer, with no leading + sign".  Or in other words, a string of all digits.  Other times, people want to validate a floating-point input, with optional sign and optional decimal point.
-Line 4:
+Line 5:
-First, you have to define what you mean by "number".  The most common case seems to be that, when people ask this, they actually mean "a non-negative integer, with no leading + sign".
+=== Hand parsing ===
If you're validating a simple "string of digits", you can do it with a [[glob]]:
-Line 7:
+Line 9:
-if [[ $foo = *[^0-9]* ]]; then
   echo "'$foo' has a non-digit somewhere in it"
+# Bash
if [[ $foo != *[!0-9]* ]]; then
    echo "'$foo' is strictly numeric"
-Line 10:
+Line 13:
-   echo "'$foo' is strictly numeric"
+    echo "'$foo' has a non-digit somewhere in it"
-Line 14:
+Line 17:
-This can be done in Korn and legacy Bourne shells as well, using {{{case}}}:
+The same thing can be done in Korn and POSIX shells as well, using {{{case}}}:
-Line 17:
+Line 20:
+# ksh, POSIX
-Line 23:
+Line 27:
-If what you actually mean is "a valid floating-point number" or something else more complex, then you might prefer to use a regular expression.  Bash version 3 and above have regular expression support in the [[ command:
+If you need to allow a leading negative sign, or if want a valid floating-point number or something else more complex, then there are a few possible ways.  Standard globs aren't expressive enough to do this, but we can use [[glob|extended globs]]:
-Line 26:
+Line 30:
-if [[ $foo =~ ^[-+]?[0-9]+\(\.[0-9]+\)?$ ]]; then
+# Bash -- extended globs must be enabled.
# Check whether the variable is all digits.
shopt -s extglob
[[ $var == +([0-9]) ]]
}}}

A more complex case:

{{{
# Bash
shopt -s extglob
[[ $foo = *[0-9]* && $foo = ?([+-])*([0-9])?(.*([0-9])) ]] &&
  echo "foo is a floating-point number"
}}}

The leading test of {{{$foo}}} is to ensure that it contains at least one digit.  The extended glob, by itself, would match the empty string, or a lone {{{+}}} or {{{-}}}, which may not be desirable behavior.

Korn shell has extended globs enabled by default, but lacks `[[`, so we must use `case` to do the glob-matching:

{{{
# Korn
case $foo in
  *[0-9]*)
    case $foo in
        ?([+-])*([0-9])?(.*([0-9]))) echo "foo is a number";;
    esac;;
esac
}}}

Note that this uses the same extended glob as the Bash example before it; the third closing parenthesis at the end of it is actually part of the case syntax.

If your definition of "a valid number" is even more complex, or if you need a solution that works in legacy Bourne shells, you might prefer to use an external tool's [[RegularExpression|regular expression]] syntax.  Here is a portable version (explained in detail [[http://www.wplug.org/wiki/Meeting-20100612#EXERCISE_TWO|here]]), using {{{egrep}}}:

{{{
# Bourne
if echo "$foo" | egrep '^[-+]?([0-9]+\.?|[0-9]*\.[0-9]+)$' >/dev/null
then
    echo "'$foo' is a number"
else
    echo "'$foo' is not a number"
fi
}}}

Bash version 3 and above have regular expression support in the [[ command.  Due to bugs and changes in the implementation of the `=~` feature throughout bash 3.x, we '''do not recommend''' using it, but people do it anyway, so we have to maintain this example (''and keep restoring this warning, too, when people delete it''):

{{{
# Bash
# Put the RE in a var for backward compatibility with versions <3.2
regexp='^[-+]?[0-9]*(\.[0-9]*)?$' 
if [[ $foo = *[0-9]* && $foo =~ $regexp ]]; then
-Line 33:
+Line 86:
-If you don't have bash version 3, then you would use {{{egrep}}}:
+=== Using the parsing done by [ and printf (or "using eq") ===
-Line 36:
+Line 89:
-if echo "$foo" | egrep '^[-+]?[0-9]+(\.[0-9]+)?$' >/dev/null; then
    echo "'$foo' might be a number"
else
    echo "'$foo' might not be a number"
+# fails with ksh
if [ "$foo" -eq "$foo" ] 2>/dev/null;then
 echo "$foo is an integer"
fi
}}} 

`[` parses the variable and interprets it as in integer because of the `-eq`. If the parsing succeds the test is trivially true; if it fails `[` prints an error message that `2>/dev/null` hides and sets a status different from 0.  However this method fails if the shell is ksh, because ksh evaluates the variable as an arithmetic expression.

You can use a similar trick with `printf`:
{{{
# POSIX
if printf "%f" "$foo" >/dev/null 2>&1; then
  echo "$foo is a float"
-Line 43:
+Line 105:
-Note that the parentheses in the {{{egrep}}} regular expression don't require backslashes in front of them, whereas the ones in the bash3 command do.
+You can use `%d` to parse an integer.  Take care that the parsing might be (is supposed to be?) [[locale]]-dependent.