Bash conventions¶
Follow bertvv’s Bash cheat sheet, the Google shell guide, and this guide’s rules (below). These three are mostly complimentary, but there are a few disagreements; this guide’s rules take precedence.
What are the specific disagreements?
- Prefer
$varover${var}. (bertvv’s cheat sheet mandates{var}.) - Use
stderrfor all logging; use stdout only for pipeable output. (The Google guide is unclear on this.) - Choose a max line length per your discretion. (The Google guide requires 80.)
- Format function headers (comments) as you see fit. (Google doesn’t specify a format, but the examples use one.)
TODO: ...is sufficient for TODO comment. (Google requiresTODO(author):, but assume the script is under version control.) UseFIXME:for unfinished or placeholder code.
Additionally, refer to:
- CLI conventions – arguments
- CLI conventions – exit codes
- CLI conventions – standard streams
- CLI conventions – standard paths
Banned Bash features¶
- Use
printf, notecho. - Use
[[ ]], not[ ]. - Use
$( ), not backticks. - Use
(( ))for numeric comparisons. - Use
[[ -z "$var" ]]and[[ -n "$var" ]]rather than[[ "$var" == "" ]]and[[ "$var != "" ]].
Shebang, header, and set options¶
Start every file like this. For more on the SPDX headers and why they matter, see the “Source Headers” in the maintainer guide.
#!/usr/bin/env bash #(1)!
# SPDX-FileCopyrightText: Copyright {date(s)}, Contributors to {project}
# SPDX-PackageHomePage: {https uri}
# SPDX-License-Identifier: {spdx id}
set -o errexit -o nounset -o pipefail # "strict mode"
- Use
/usr/bin/envto honor the user’s environment./bin/bash,/usr/bin/bash, and/usr/local/bin/bashcould all be different.
The set -o options are important. With IFS=$'\n\t', these options are sometimes called Bash Strict Mode. IFS=$'\n\t' is omitted here because it can make code brittle.
Importantly: When writing your code, pretend these options are not enabled. This forces you to handle errors explicitly, with proper comments, error codes, and error messages. It also makes the code more interoperable and shareable.
Warning
Why you should be vigilant about quoting variables: Just search for GitHub issues about unquoted shell variables.
Variables¶
Use UPPERCASE_SNAKE_CASE for environment variables and lowercase_snake_case for others. Prefer $var; use ${var} only when needed.
Rationale
I recently (2024-08) changed my mind on $var vs. ${var} per less > more. Decent syntax highlighters already distinguish variables clearly.
Logging format¶
For longer-running scripts, indicate the level of each logging statement (e.g. [INFO]). Use the levels DEBUG, INFO, WARN, and ERROR, with INFO as the default level. Other structured data may be included (e.g. date-time, time elapsed, % progress, thread id).
If desired, you can distinguish logging levels with ANSI color and/or style codes. Refer to CLI conventions – color for details. See todos.sh for an example.
Example log file
Standard paths¶
Use the XDG Base Directory Standard. For example:
Important: Remember that ~ doesn’t expand inside double quotes. Always use $HOME inside scripts instead.
Full XDG directory handling
You can use the following for full parsing of XDG_DATA_HOME, XDG_CONFIG_HOME, XDG_STATE_HOME, XDG_CACHE_HOME, XDG_DATA_DIRS, XDG_CONFIG_DIRS, ~/.local/bin, and XDG_RUNTIME_DIR.
declare -r app=...
# `XDG_*_HOME` variables:
declare -r data_home="${XDG_DATA_HOME:-$HOME/.local/share}"/"$app"
declare -r config_home="${XDG_CONFIG_HOME:-$HOME/.config}"/"$app"
declare -r state_home="${XDG_STATE_HOME:-$HOME/.local/state}"/"$app"
declare -r cache_home="${XDG_CACHE_HOME:-$HOME/.cache}"/"$app"
# `:`-separated config dirs (in order of decreasing priority):
IFS=: read -a data_dirs <<< "${XDG_DATA_DIRS:-/usr/local/share:/usr/share}"
for i in "${!data_dirs[@]}"; do
data_dirs[i]="${data_dirs[i]}"/"$app"
done
# `:`-separated config dirs (in order of decreasing priority):
IFS=: read -a config_dirs <<< "${XDG_CONFIG_DIRS:-/etc/xdg}"
for i in "${!config_dirs[@]}"; do
config_dirs[i]="${config_dirs[i]}"/"$app"
done
# User executables (which doesn't have a corresponding XDG env var):
declare -r user_bin="$HOME/.local/bin"
# `XDG_RUNTIME_DIR`, a per-user temp dir (normally `/run/{user-id}/`):
declare -r runtime_dir="${XDG_RUNTIME_DIR:-${TMPDIR:-/tmp}/$(id -u)}"
if [[ -z "$XDG_RUNTIME_DIR" ]]; then
printf >&2 "[WARN] XDG_RUNTIME_DIR is not set; defaulting to '%s'.\n" "$runtime_dir"
[[ ! -e "$runtime_dir" ]] && mkdir "$runtime_dir"
fi
Script directory and name¶
Rationale
This is the only reasonable solution that:
- Works on both Linux and BSD(/macOS); and
- works even if the script path is relative or absolute; is a symlink or contains a symlinked component; contains
/../,/./, or//; wassource-ed; lives inside the current CWD; contains a space; starts with-; or whose final component starts with-.
Note: This even works with paths that contain [*?], [|&<>],[‘“], and even\` and possibly control chars. However, please, please don’t use such characters in filenames.
Rely on BASH_SOURCE and realpath. Although realpath is not POSIX, it is available on all modern Linux and BSD/macOS distributions.
Use the following definitions as needed, where my_namespace is a namespace you expect would be unique if the script is sourced. If the script is not intended to be sourced, using e.g. script_name is ok.
my_namespace__path="$(realpath -- "${BASH_SOURCE[0]}" || exit $?)"
declare -r my_namespace__name="${my_namespace__path##*/}"
declare -r my_namespace__dir="${my_namespace__path%/*}"
unset my_namespace__path # (1)!
- Assuming it’s not needed.
Make sure to use -- with realpath to protect paths that start with -. (Don’t use -e because it is GNU-specific and not necessary.) Because realpath returns canonical paths, the Bash parameter expansions work perfectly, and dirname/basename are not needed.
GNU vs. BSD realpath in other contexts
GNU/Linux and BSD(/macOS) realpath differ. Most importantly, BSD’s will error if a path component is not found. That’s a non-issue here because ${BASH_SOURCE[0]} must exist, but this difference could cause problems elsewhere.
Pass -e/--canonicalize-missing to GNU realpath to require that components exist. Don’t rely on "$(uname)" == Linux or "$OSTYPE" == linux-gnu because, for example, realpath might be aliased to Brew coreutils realpath on macOS. This snippet is a reasonable solution:
Definitions¶
Refer to the Google style guide.
Make variables read-only whenever possible, using declare -r. For functions, use readonly -f (or, equivalently,declare -r -g -f).
Warning
Always use readonly -f after defining the function.
As per the Google guide, choose a namespace and prefix function names with namespace::. Prefix non-exported global variables with namespace__. You may omit these prefixes if the script is not intended to be sourced.
Warning
Name collisions in Bash are usually silent.
The following example illustrates why the above rules are important.
library.sh
cli.sh
Note that run_lib calls the count redefined by its caller, and that cli.sh uses the script name of lib.sh.
Prevent this by marking variables and functions readonly and using namespace prefixes.
Parsing command-line arguments¶
If your script doesn’t define options except -h/--help you can parse the arguments directly with $1, etc. The example below handles -h and --help correctly. Otherwise, use a case statement; see todos.sh as an example.
Rationale
getopts, as well the variant of getopt on BSD, cannot parse –long-style options. getopt has other differences between distributions, as well.
Formatting and comments¶
Line breaks¶
Choose a max line length; recommended choices are ≤80 and ≤100. For long or complex lines, put each argument on its own line (“chomping” style), and otherwise be thoughtful and consistent about where you break lines.
Comments¶
Use comments as needed, not out of habit. Above non-trivial functions, add a header that lists the arguments, environment variables, return codes, etc. It’s ok to be brief or to refer to other documentation.
TODO and FIXME¶
Use TODO: short description for minor issues, and FIXME: short description for things like unfinished or placeholder code.
Example¶
#!/usr/bin/env bash
# SPDX-FileCopyrightText: Copyright 2024, Contributors to the gamma package
# SPDX-PackageHomePage: https://github.com/the-gamma-people/gamma
# SPDX-License-Identifier: Apache-2.0
set -o errexit -o nounset -o pipefail # "strict mode"
script_path="$(realpath -- "${BASH_SOURCE[0]}" || exit $?)"
declare -r script_name="${script_path##*/}"
declare -r -i default_retries=2
declare -r description="Checks network access by pinging google."
declare -r usage="Usage: $script_name [number of pings (default: $default_retries)]"
log_info() {
printf >&2 '[INFO] %s\n' "$1" # (1)!
}
log_error() {
# Highlight errors in red if we're on a TTY.
if [[ -t 2 ]]; then
printf >&2 '\e[0;31m[ERROR] %s\e[0m\n' "$1"
else
printf >&2 '[ERROR] %s\n' "$1"
fi
}
if (($# == 1)) && [[ "$1" == "-h" || "$1" == "--help" ]]; then
printf '%s\n%s\n' "$description" "$usage"
exit 0
fi
if (($# > 1)); then
log_error "Invalid usage."
printf '%s\n' "$usage"
exit 2 # (2)!
fi
declare -r -i count=$(("${1:-}" || $default_count))
do_ping() {
ping -c "$count" google.com > /dev/null 2> /dev/null || return $?
}
main() {
log_info "Starting..."
if ! do_ping; then
log_error "Cannot ping Google. Check cables?"
return 1
fi
log_info "Done."
}
main || exit 1 # (3)!
# Alternative: main || exit $? # (4)!
- Always log to stderr; stdout is for machine-readable output.
exit 2for usage errors.- If
mainerrored, exit 1. - You can use
|| exit $?to forwardmain’s return code.
*[TTY]: TeleTYpe device