cqlsh: the CQL shell
cqlsh
is a command-line interface for interacting with Cassandra using CQL (the Cassandra Query Language).
It is shipped with every Cassandra package, and can be found in the bin/ directory alongside the cassandra
executable.
cqlsh
is implemented with the Python native protocol driver, and connects to the single specified node.
Compatibility
cqlsh
is compatible with Python 2.7.
In general, a given version of cqlsh
is only guaranteed to work with the
version of Cassandra that it was released with.
In some cases, cqlsh
may work with older or newer versions of Cassandra, but this is not
officially supported.
Optional Dependencies
cqlsh
ships with all essential dependencies. However, there are some
optional dependencies that can be installed to improve the capabilities
of cqlsh
.
cython
The performance of cqlsh’s COPY
operations can be improved by
installing cython. This will compile the python
modules that are central to the performance of COPY
.
cqlshrc
The cqlshrc
file holds configuration options for cqlsh
.
By default, the file is locagted the user’s home directory at ~/.cassandra/cqlsh
, but a
custom location can be specified with the --cqlshrc
option.
Example config values and documentation can be found in the
conf/cqlshrc.sample
file of a tarball installation.
You can also view the latest version of the
cqlshrc file online.
credentials
The credentials file contains a user-name and a password for cqlsh.
A user-name and a password must be located in a section whose name matches the classname from the [auth_provider]
section of the cqlshrc file.
The credentials file must be owned by the user and no one else has permission to read the file.
Example config values and documentation can be found in the
conf/credentials.sample
file of a tarball installation.
cql history
All CQL commands you execute are written to a history file. By default, CQL history will be written to ~/.cassandra/cql_history
. You can change this default by setting the environment variable CQL_HISTORY
like ~/some/other/path/to/cqlsh_history
where cqlsh_history
is a file. All parent directories to history file will be created if they do not exist. If you do not want to persist history, you can do so by setting CQL_HISTORY to /dev/null.
This feature is supported from Cassandra 4.1.
Command Line Options
Usage: cqlsh.py [options] [host [port]]
CQL Shell for Apache Cassandra
Options:
--version
-
show program’s version number and exit
-h
--help
-
show this help message and exit
-C
--color
-
Always use color output
--no-color
-
Never use color output
--browser=BROWSER
-
The browser to use to display CQL help, where BROWSER can be: one of the supported browsers in docs.python.org/3/library/webbrowser.html. browser path followed by %s, example: /usr/bin/google-chrome-stable %s
--ssl
-
Use SSL
-u USERNAME
--username=USERNAME
-
Authenticate as user.
-p PASSWORD
--password=PASSWORD
-
Authenticate using password.
-k KEYSPACE
--keyspace=KEYSPACE
-
Authenticate to the given keyspace.
-f FILE
--file=FILE
-
Execute commands from FILE, then exit
--debug
-
Show additional debugging information
--coverage
-
Collect coverage data
--encoding=ENCODING
-
Specify a non-default encoding for output. (Default: utf-8)
--cqlshrc=CQLSHRC
-
Specify an alternative cqlshrc file location.
--credentials=CREDENTIALS
-
Specify an alternative credentials file location.
--cqlversion=CQLVERSION
-
Specify a particular CQL version, by default the highest version supported by the server will be used. Examples: "3.0.3", "3.1.0"
--protocol-version=PROTOCOL_VERSION
-
Specify a specific protcol version otherwise the client will default and downgrade as necessary
-e EXECUTE
--execute=EXECUTE
-
Execute the statement and quit.
--connect-timeout=CONNECT_TIMEOUT
-
Specify the connection timeout in seconds (default: 5 seconds).
--request-timeout=REQUEST_TIMEOUT
-
Specify the default request timeout in seconds (default: 10 seconds).
-t, --tty
-
Force tty mode (command prompt).
-v
--v
-
Print the current version of cqlsh.
Special Commands
In addition to supporting regular CQL statements, cqlsh
also supports a
number of special commands that are not part of CQL. These are detailed
below.
CONSISTENCY
Usage
: CONSISTENCY <consistency level>
Sets the consistency level for operations to follow. Valid arguments include:
-
ANY
-
ONE
-
TWO
-
THREE
-
QUORUM
-
ALL
-
LOCAL_QUORUM
-
LOCAL_ONE
-
SERIAL
-
LOCAL_SERIAL
SERIAL CONSISTENCY
Usage
: SERIAL CONSISTENCY <consistency level>
Sets the serial consistency level for operations to follow. Valid arguments include:
-
SERIAL
-
LOCAL_SERIAL
The serial consistency level is only used by conditional updates
(INSERT
, UPDATE
and DELETE
with an IF
condition). For those, the
serial consistency level defines the consistency level of the serial
phase (or “paxos” phase) while the normal consistency level defines the
consistency for the “learn” phase, i.e. what type of reads will be
guaranteed to see the update right away. For example, if a conditional
write has a consistency level of QUORUM
(and is successful), then a
QUORUM
read is guaranteed to see that write. But if the regular
consistency level of that write is ANY
, then only a read with a
consistency level of SERIAL
is guaranteed to see it (even a read with
consistency ALL
is not guaranteed to be enough).
SHOW VERSION
Prints the cqlsh
, Cassandra, CQL, and native protocol versions in use.
Example:
cqlsh> SHOW VERSION
[cqlsh 5.0.1 | Cassandra 3.8 | CQL spec 3.4.2 | Native protocol v4]
SHOW HOST
Prints the IP address and port of the Cassandra node that cqlsh
is
connected to in addition to the cluster name. Example:
cqlsh> SHOW HOST
Connected to Prod_Cluster at 192.0.0.1:9042.
SHOW SESSION
Pretty prints a specific tracing session.
Usage
: SHOW SESSION <session id>
Example usage:
cqlsh> SHOW SESSION 95ac6470-327e-11e6-beca-dfb660d92ad8
Tracing session: 95ac6470-327e-11e6-beca-dfb660d92ad8
activity | timestamp | source | source_elapsed | client
-----------------------------------------------------------+----------------------------+-----------+----------------+-----------
Execute CQL3 query | 2016-06-14 17:23:13.979000 | 127.0.0.1 | 0 | 127.0.0.1
Parsing SELECT * FROM system.local; [SharedPool-Worker-1] | 2016-06-14 17:23:13.982000 | 127.0.0.1 | 3843 | 127.0.0.1
...
SOURCE
Reads the contents of a file and executes each line as a CQL statement or special cqlsh command.
Usage
: SOURCE <string filename>
Example usage:
cqlsh> SOURCE '/home/calvinhobbs/commands.cql'
CAPTURE
Begins capturing command output and appending it to a specified file. Output will not be shown at the console while it is captured.
Usage
:
CAPTURE '<file>';
CAPTURE OFF;
CAPTURE;
That is, the path to the file to be appended to must be given inside a
string literal. The path is interpreted relative to the current working
directory. The tilde shorthand notation ('~/mydir'
) is supported for
referring to $HOME
.
Only query result output is captured. Errors and output from cqlsh-only commands will still be shown in the cqlsh session.
To stop capturing output and show it in the cqlsh session again, use
CAPTURE OFF
.
To inspect the current capture configuration, use CAPTURE
with no
arguments.
HELP
Gives information about cqlsh commands. To see available topics, enter
HELP
without any arguments. To see help on a topic, use
HELP <topic>
. Also see the --browser
argument for controlling what
browser is used to display help.
TRACING
Enables or disables tracing for queries. When tracing is enabled, once a query completes, a trace of the events during the query will be printed.
Usage
:
TRACING ON
TRACING OFF
PAGING
Enables paging, disables paging, or sets the page size for read queries. When paging is enabled, only one page of data will be fetched at a time and a prompt will appear to fetch the next page. Generally, it’s a good idea to leave paging enabled in an interactive session to avoid fetching and printing large amounts of data at once.
Usage
:
PAGING ON
PAGING OFF
PAGING <page size in rows>
EXPAND
Enables or disables vertical printing of rows. Enabling EXPAND
is
useful when many columns are fetched, or the contents of a single column
are large.
Usage
:
EXPAND ON
EXPAND OFF
LOGIN
Authenticate as a specified Cassandra user for the current session.
Usage
:
LOGIN <username> [<password>]
DESCRIBE
Prints a description (typically a series of DDL statements) of a schema element or the cluster. This is useful for dumping all or portions of the schema.
Usage
:
DESCRIBE CLUSTER
DESCRIBE SCHEMA
DESCRIBE KEYSPACES
DESCRIBE KEYSPACE <keyspace name>
DESCRIBE TABLES
DESCRIBE TABLE <table name>
DESCRIBE INDEX <index name>
DESCRIBE MATERIALIZED VIEW <view name>
DESCRIBE TYPES
DESCRIBE TYPE <type name>
DESCRIBE FUNCTIONS
DESCRIBE FUNCTION <function name>
DESCRIBE AGGREGATES
DESCRIBE AGGREGATE <aggregate function name>
In any of the commands, DESC
may be used in place of DESCRIBE
.
The DESCRIBE CLUSTER
command prints the cluster name and partitioner:
cqlsh> DESCRIBE CLUSTER
Cluster: Test Cluster
Partitioner: Murmur3Partitioner
The DESCRIBE SCHEMA
command prints the DDL statements needed to
recreate the entire schema. This is especially useful for dumping the
schema in order to clone a cluster or restore from a backup.
COPY TO
Copies data from a table to a CSV file.
Usage
:
COPY <table name> [(<column>, ...)] TO <file name> WITH <copy option> [AND <copy option> ...]
If no columns are specified, all columns from the table will be copied to the CSV file. A subset of columns to copy may be specified by adding a comma-separated list of column names surrounded by parenthesis after the table name.
The <file name>
should be a string literal (with single quotes)
representing a path to the destination file. This can also the special
value STDOUT
(without single quotes) to print the CSV to stdout.
See shared-copy-options
for options that apply to both COPY TO
and
COPY FROM
.
Options for COPY TO
MAXREQUESTS
-
The maximum number token ranges to fetch simultaneously. Defaults to 6.
PAGESIZE
-
The number of rows to fetch in a single page. Defaults to 1000.
PAGETIMEOUT
-
By default the page timeout is 10 seconds per 1000 entries in the page size or 10 seconds if pagesize is smaller.
BEGINTOKEN
,ENDTOKEN
-
Token range to export. Defaults to exporting the full ring.
MAXOUTPUTSIZE
-
The maximum size of the output file measured in number of lines; beyond this maximum the output file will be split into segments. -1 means unlimited, and is the default.
ENCODING
-
The encoding used for characters. Defaults to
utf8
.
COPY FROM
Copies data from a CSV file to table.
Usage
:
COPY <table name> [(<column>, ...)] FROM <file name> WITH <copy option> [AND <copy option> ...]
If no columns are specified, all columns from the CSV file will be copied to the table. A subset of columns to copy may be specified by adding a comma-separated list of column names surrounded by parenthesis after the table name.
The <file name>
should be a string literal (with single quotes)
representing a path to the source file. This can also the special value
STDIN
(without single quotes) to read the CSV data from stdin.
See shared-copy-options
for options that apply to both COPY TO
and
COPY FROM
.
Options for COPY TO
INGESTRATE
-
The maximum number of rows to process per second. Defaults to 100000.
MAXROWS
-
The maximum number of rows to import. -1 means unlimited, and is the default.
SKIPROWS
-
A number of initial rows to skip. Defaults to 0.
SKIPCOLS
-
A comma-separated list of column names to ignore. By default, no columns are skipped.
MAXPARSEERRORS
-
The maximum global number of parsing errors to ignore. -1 means unlimited, and is the default.
MAXINSERTERRORS
-
The maximum global number of insert errors to ignore. -1 means unlimited. The default is 1000.
ERRFILE
=-
A file to store all rows that could not be imported, by default this is
import_<ks>_<table>.err
where<ks>
is your keyspace and<table>
is your table name. MAXBATCHSIZE
-
The max number of rows inserted in a single batch. Defaults to 20.
MINBATCHSIZE
-
The min number of rows inserted in a single batch. Defaults to 2.
CHUNKSIZE
-
The number of rows that are passed to child worker processes from the main process at a time. Defaults to 1000.
Shared COPY Options
Options that are common to both COPY TO
and COPY FROM
.
NULLVAL
-
The string placeholder for null values. Defaults to
null
. HEADER
-
For
COPY TO
, controls whether the first line in the CSV output file will contain the column names. For COPY FROM, specifies whether the first line in the CSV input file contains column names. Defaults tofalse
. DECIMALSEP
-
The character that is used as the decimal point separator. Defaults to
.
. THOUSANDSSEP
-
The character that is used to separate thousands. Defaults to the empty string.
BOOLSTYlE
-
The string literal format for boolean values. Defaults to
True,False
. NUMPROCESSES
-
The number of child worker processes to create for
COPY
tasks. Defaults to a max of 4 forCOPY FROM
and 16 forCOPY TO
. However, at most (num_cores - 1) processes will be created. MAXATTEMPTS
-
The maximum number of failed attempts to fetch a range of data (when using
COPY TO
) or insert a chunk of data (when usingCOPY FROM
) before giving up. Defaults to 5. REPORTFREQUENCY
-
How often status updates are refreshed, in seconds. Defaults to 0.25.
RATEFILE
-
An optional file to output rate statistics to. By default, statistics are not output to a file.