Diff for "BashFAQ/054"

Differences between revisions 32 and 36 (spanning 4 versions)

How can I tell whether a variable contains a valid number?

First, you have to define what you mean by "number". The most common case when people ask this seems to be "a non-negative integer, with no leading + sign". Or in other words, a string of all digits. Other times, people want to validate a floating-point input, with optional sign and optional decimal point.

Hand parsing

If you're validating a simple "string of digits", you can do it with a glob:

# Bash
if [[ $foo != *[!0-9]* ]]; then
    echo "'$foo' is strictly numeric"
else
    echo "'$foo' has a non-digit somewhere in it"
fi

The same thing can be done in Korn and POSIX shells as well, using case:

# POSIX
case $var in
    *[!0-9]*|'')
        printf '%s has a non-digit somewhere in it\' "$var"
        ;;
    *)
        printf '%s is strictly numeric\n' "$var"
esac >&2

If you need to allow a leading negative sign, or if want a valid floating-point number or something else more complex, then there are a few possible ways. Standard globs aren't expressive enough to do this, but we can use extended globs:

# Bash -- extended globs must be enabled explicitly in versions prior to 4.1.
# Check whether the variable is all digits.
shopt -s extglob
[[ $var == +([0-9]) ]]

A more complex case:

# Bash / ksh
shopt -s extglob

if [[ $foo = *[0-9]* && $foo = ?([+-])*([0-9])?(.*([0-9])) ]]; then
  echo 'foo is a floating-point number'
fi

Optionally, case..esac may have been used in shells with extended pattern matching. The leading test of $foo is to ensure that it contains at least one digit. The extended glob, by itself, would match the empty string, or a lone + or -, which may not be desirable behavior.

If your definition of "a valid number" is even more complex, or if you need a solution that works in legacy Bourne shells, you might prefer to use an external tool's regular expression syntax. Here is a portable version (explained in detail here), using egrep:

# Bourne
if echo "$foo" | grep -qE '^[-+]?([0-9]+\.?|[0-9]*\.[0-9]+)$'; then
    echo "'$foo' is a number"
else
    echo "'$foo' is not a number"
fi

Bash version 3 and above have regular expression support in the [[ command.

# Bash
# The regexp must be stored in a var and expanded for backward compatibility with versions < 3.2

regexp='^[-+]?[0-9]*(\.[0-9]*)?$'
if [[ $foo = *[0-9]* && $foo =~ $regexp ]]; then
    echo "'$foo' looks rather like a number"
else
    echo "'$foo' doesn't look particularly numeric to me"
fi

Using the parsing done by [ and printf (or "using eq")

# fails with ksh
if [ "$foo" -eq "$foo" ] 2>/dev/null; then
 echo "$foo is an integer"
fi

[ parses the variable and interprets it as an integer because of the -eq. If the parsing succeeds the test is trivially true; if it fails [ prints an error message that 2>/dev/null hides and sets a status different from 0. However this method fails if the shell is ksh, because ksh evaluates the variable as an arithmetic expression.

You can use a similar trick with printf, but this won't work in all shells either:

# BASH
if printf %f "$foo" >/dev/null 2>&1; then
  echo "$foo is a float"
fi

You can use %d to parse an integer. Take care that the parsing might be (is supposed to be?) locale-dependent.

-  ⇤ ← Revision 32 as of 2012-12-02 02:34:38 → 
  Size: 4278
  Editor: S01060048548a70fe
  Comment:
+   ← Revision 36 as of 2014-05-02 14:28:21 → ⇥
  Size: 3574
  Editor: ormaaj
  Comment: Fix some bugs. ksh has always supported [[. rm'd example. =~ shouldn't be shunned due to bash 3's bad implementation. rm'd (again). =~ (and [[) will quite likely be POSIX at some point.
-Deletions are marked like this.
+Additions are marked like this.
 Line 2:
-Line 20:
+Line 19:
-# ksh, POSIX
case "$foo" in
    *[!0-9]*) echo "'$foo' has a non-digit somewhere in it" ;;
    *) echo "'$foo' is strictly numeric" ;;
esac
+# POSIX
case $var in
    *[!0-9]*|'')
        printf '%s has a non-digit somewhere in it\' "$var"
        ;;
    *)
        printf '%s is strictly numeric\n' "$var"
esac >&2
-Line 29:
+Line 31:
-# Bash -- extended globs must be enabled.
+# Bash -- extended globs must be enabled explicitly in versions prior to 4.1.
-Line 34:
+Line 36:
-Line 37:
+Line 40:
-# Bash
+# Bash / ksh
-Line 39:
+Line 42:
-[[ $foo = *[0-9]* && $foo = ?([+-])*([0-9])?(.*([0-9])) ]] &&
  echo "foo is a floating-point number"
+if [[ $foo = *[0-9]* && $foo = ?([+-])*([0-9])?(.*([0-9])) ]]; then
  echo 'foo is a floating-point number'
fi
-Line 42:
+Line 47:
-The leading test of {{{$foo}}} is to ensure that it contains at least one digit.  The extended glob, by itself, would match the empty string, or a lone {{{+}}} or {{{-}}}, which may not be desirable behavior.
-Line 44:
+Line 48:
-Korn shell has extended globs enabled by default, but lacks `[[`, so we must use `case` to do the glob-matching:
+Optionally, `case..esac` may have been used in shells with extended pattern matching. The leading test of {{{$foo}}} is to ensure that it contains at least one digit.  The extended glob, by itself, would match the empty string, or a lone {{{+}}} or {{{-}}}, which may not be desirable behavior.
-Line 46:
+Line 50:
-{{{
# Korn
case $foo in
  *[0-9]*)
    case $foo in
        ?([+-])*([0-9])?(.*([0-9]))) echo "foo is a number";;
    esac;;
esac
}}}
Note that this uses the same extended glob as the Bash example before it; the third closing parenthesis at the end of it is actually part of the case syntax.
-Line 61:
+Line 55:
-if echo "$foo" | egrep '^[-+]?([0-9]+\.?|[0-9]*\.[0-9]+)$' >/dev/null
then
+if echo "$foo" | grep -qE '^[-+]?([0-9]+\.?|[0-9]*\.[0-9]+)$'; then
-Line 68:
+Line 61:
-Bash version 3 and above have regular expression support in the [[ command.  Due to bugs and changes in the implementation of the `=~` feature throughout bash 3.x, we '''do not recommend''' using it, but people do it anyway, so we have to maintain this example (''and keep restoring this warning, too, when people delete it''):
+Bash version 3 and above have regular expression support in the [[ command.
-Line 72:
+Line 65:
-# Put the RE in a var for backward compatibility with versions <3.2
+# The regexp must be stored in a var and expanded for backward compatibility with versions < 3.2
-Line 80:
+Line 74:
-Line 83:
+Line 78:
-if [ "$foo" -eq "$foo" ] 2>/dev/null;then
+if [ "$foo" -eq "$foo" ] 2>/dev/null; then
-Line 87:
+Line 82:
-`[` parses the variable and interprets it as in integer because of the `-eq`. If the parsing succeds the test is trivially true; if it fails `[` prints an error message that `2>/dev/null` hides and sets a status different from 0.  However this method fails if the shell is ksh, because ksh evaluates the variable as an arithmetic expression.
+`[` parses the variable and interprets it as an integer because of the `-eq`. If the parsing succeeds the test is trivially true; if it fails `[` prints an error message that `2>/dev/null` hides and sets a status different from 0.  However this method fails if the shell is ksh, because ksh evaluates the variable as an arithmetic expression.
-Line 89:
+Line 84:
-You can use a similar trick with `printf`:
+You can use a similar trick with `printf`, but this won't work in all shells either:
-Line 92:
+Line 87:
-# POSIX
if printf "%f" "$foo" >/dev/null 2>&1; then
+# BASH
if printf %f "$foo" >/dev/null 2>&1; then
-Line 97:
+Line 92:
-Line 98:
+Line 94:
-=== Using ((...)) to determine if input is integer ===
{{{
if (("$foo")) >/dev/null 2>&1; then
  echo "$foo is an integer"
fi
}}}
the  command (($foo)) tries to parse foo as an integer arithmetic  expression, which returns a non-zero error code if foo is not an  integer.