Virtual Tables
Virtual Tables let a developer provide an underlying table implementations, while still presenting a normal SQL interface to the user. The person writing SQL doesn’t need to know or care that some of the tables come from elsewhere.
Some examples of how you might use this:
Translating to/from information stored in other formats
Accessing the data remotely (eg you could make a table that backends into the cloud)
Dynamic information (eg currently running processes, files and directories, objects in your program)
There are other examples on the SQLite page
Tip
You’ll find initial development a lot quicker by using
apsw.ext.make_virtual_module()
which lets you
export a Python function as a virtual table, being
able to provide positional and keyword arguments as
part of your query.
See the example.
To write a virtual table, you need to have 3 types of object. A
module
providing the module, a virtual table
,
and a cursor
that moves through a table.
IndexInfo class
- class apsw.IndexInfo
IndexInfo represents the sqlite3_index_info and associated methods used in the
VTTable.BestIndexObject()
method.Naming is identical to the C structure rather than Pythonic. You can access members directly while needing to use get/set methods for array members.
You will get
InvalidContextError
if you use the object outside of an BestIndex method.apsw.ext.index_info_to_dict()
provides a convenient representation of this object as adict
.
- IndexInfo.colUsed: set[int]
(Read-only) Columns used by the statement. Note that a set is returned, not the underlying integer.
- IndexInfo.distinct: int
(Read-only) How the query planner would like output ordered if the query is using group by or distinct.
Calls: sqlite3_vtab_distinct
- IndexInfo.get_aConstraintUsage_argvIndex(which: int) int
Returns argvIndex for aConstraintUsage[which]
- IndexInfo.get_aConstraintUsage_in(which: int) bool
Returns True if the constraint is in - eg column in (3, 7, 9)
Calls: sqlite3_vtab_in
- IndexInfo.get_aConstraint_collation(which: int) str
Returns collation name for aConstraint[which]
Calls: sqlite3_vtab_collation
- IndexInfo.get_aConstraint_rhs(which: int) SQLiteValue
Returns right hand side value if known, else None.
Calls: sqlite3_vtab_rhs_value
- IndexInfo.idxFlags: int
Mask of
SQLITE_INDEX_SCAN flags
- IndexInfo.set_aConstraintUsage_argvIndex(which: int, argvIndex: int) None
Sets argvIndex for aConstraintUsage[which]
- IndexInfo.set_aConstraintUsage_in(which: int, filter_all: bool) None
If which is an in constraint, and filter_all is True then your
VTCursor.Filter()
method will have all of the values at once.Calls: sqlite3_vtab_in
VTModule class
- class apsw.VTModule
Note
There is no actual VTModule class - it is shown this way for documentation convenience and is present as a typing protocol.
A module instance is used to create the virtual tables. Once you have
a module object, you register it with a connection by calling
Connection.create_module()
:
# make an instance
mymod=MyModuleClass()
# register the vtable on connection con
con.create_module("modulename", mymod)
# tell SQLite about the table
con.execute("create VIRTUAL table tablename USING modulename('arg1', 2)")
The create step is to tell SQLite about the existence of the table. Any number of tables referring to the same module can be made this way.
- VTModule.Connect(connection: Connection, modulename: str, databasename: str, tablename: str, *args: tuple[SQLiteValue, ...]) tuple[str, VTTable]
The parameters and return are identical to
Create()
. This method is called when there are additional references to the table.Create()
will be called the first time andConnect()
after that.The advise is to create caches, generated data and other heavyweight processing on
Create()
calls and then find and reuse that on the subsequentConnect()
calls.The corresponding call is
VTTable.Disconnect()
. If you have a simple virtual table implementation, then just setConnect()
to be the same asCreate()
:class MyModule: def Create(self, connection, modulename, databasename, tablename, *args): # do lots of hard work Connect=Create
- VTModule.Create(connection: Connection, modulename: str, databasename: str, tablename: str, *args: tuple[SQLiteValue, ...]) tuple[str, VTTable]
Called when a table is first created on a
connection
.- Parameters:
connection – An instance of
Connection
modulename – The string name under which the module was
registered
databasename – The name of the database. main, temp, the name in ATTACH
tablename – Name of the table the user wants to create.
args – Any arguments that were specified in the create virtual table statement.
- Returns:
A list of two items. The first is a SQL create table statement. The columns are parsed so that SQLite knows what columns and declared types exist for the table. The second item is an object that implements the
table
methods.
The corresponding call is
VTTable.Destroy()
.
- VTModule.ShadowName(table_suffix: str) bool
This method is called to check if table_suffix is a shadow name
The default implementation always returns False.
If a virtual table is created using this module named
example
and then a real table is created namedexample_content
, this would be called with a table_suffix ofcontent
VTTable class
- class apsw.VTTable
Note
There is no actual VTTable class - it is shown this way for documentation convenience and is present as a typing protocol.
The
VTTable
object contains knowledge of the indices, makes cursors and can perform transactions.A virtual table is structured as a series of rows, each of which has the same number of columns. The value in a column must be one of the 5 supported types, but the type can be different between rows for the same column. The virtual table routines identify the columns by number, starting at zero.
Each row has a unique 64 bit integer rowid with the
Cursor
routines operating on this number, as well as some of theTable
routines such asUpdateChangeRow
.It is possible to not have a rowid
- VTTable.Begin() None
This function is used as part of transactions. You do not have to provide the method.
- VTTable.BestIndex(constraints: Sequence[tuple[int, int]], orderbys: Sequence[tuple[int, int]]) Any
This is a complex method. To get going initially, just return None and you will be fine. You should also consider using
BestIndexObject()
instead.Implementing this method reduces the number of rows scanned in your table to satisfy queries, but only if you have an index or index like mechanism available.
Note
The implementation of this method differs slightly from the SQLite documentation for the C API. You are not passed “unusable” constraints. The argv/constraintarg positions are not off by one. In the C api, you have to return position 1 to get something passed to
VTCursor.Filter()
in position 0. With the APSW implementation, you return position 0 to get Filter arg 0, position 1 to get Filter arg 1 etc.The purpose of this method is to ask if you have the ability to determine if a row meets certain constraints that doesn’t involve visiting every row. An example constraint is
price > 74.99
. In a traditional SQL database, queries with constraints can be speeded up with indices. If you return None, then SQLite will visit every row in your table and evaluate the constraints itself. Your index choice returned from BestIndex will also be passed to theFilter()
method on your cursor object. Note that SQLite may call this method multiple times trying to find the most efficient way of answering a complex query.constraints
You will be passed the constraints as a sequence of tuples containing two items. The first item is the column number and the second item is the operation.
Example query:
select * from foo where price > 74.99 and quantity<=10 and customer='Acme Widgets'
If customer is column 0, price column 2 and quantity column 5 then the constraints will be:
(2, apsw.SQLITE_INDEX_CONSTRAINT_GT), (5, apsw.SQLITE_INDEX_CONSTRAINT_LE), (0, apsw.SQLITE_INDEX_CONSTRAINT_EQ)
Note that you do not get the value of the constraint (ie “Acme Widgets”, 74.99 and 10 in this example).
If you do have any suitable indices then you return a sequence the same length as constraints with the members mapping to the constraints in order. Each can be one of None, an integer or a tuple of an integer and a boolean. Conceptually SQLite is giving you a list of constraints and you are returning a list of the same length describing how you could satisfy each one.
Each list item returned corresponding to a constraint is one of:
- None
This means you have no index for that constraint. SQLite will have to iterate over every row for it.
- integer
This is the argument number for the constraintargs being passed into the
Filter()
function of yourcursor
(the values “Acme Widgets”, 74.99 and 10 in the example).- (integer, boolean)
By default SQLite will check what you return. For example if you said that you had an index on price and so would only return rows greater than 74.99, then SQLite will still check that each row you returned is greater than 74.99. If the boolean is True then SQLite will not double check, while False retains the default double checking.
Example query:
select * from foo where price > 74.99 and quantity<=10 and customer=='Acme Widgets'
. customer is column 0, price column 2 and quantity column 5. You can index on customer equality and price.Constraints (in)
Constraints used (out)
(2, apsw.SQLITE_INDEX_CONSTRAINT_GT), (5, apsw.SQLITE_INDEX_CONSTRAINT_LE), (0, apsw.SQLITE_INDEX_CONSTRAINT_EQ)
1, None, 0
When your
Filter
method in the cursor is called, constraintarg[0] will be “Acme Widgets” (customer constraint value) and constraintarg[1] will be 74.99 (price constraint value). You can also return an index number (integer) and index string to use (SQLite attaches no significance to these values - they are passed as is to yourVTCursor.Filter()
method as a way for the BestIndex method to let theFilter()
method know which of your indices or similar mechanism to use.orderbys
The second argument to BestIndex is a sequence of orderbys because the query requested the results in a certain order. If your data is already in that order then SQLite can give the results back as is. If not, then SQLite will have to sort the results first.
Example query:
select * from foo order by price desc, quantity asc
Price is column 2, quantity column 5 so orderbys will be:
(2, True), # True means descending, False is ascending (5, False)
Return
You should return up to 5 items. Items not present in the return have a default value.
- 0: constraints used (default None)
This must either be None or a sequence the same length as constraints passed in. Each item should be as specified above saying if that constraint is used, and if so which constraintarg to make the value be in your
VTCursor.Filter()
function.- 1: index number (default zero)
This value is passed as is to
VTCursor.Filter()
- 2: index string (default None)
This value is passed as is to
VTCursor.Filter()
- 3: orderby consumed (default False)
Return True if your output will be in exactly the same order as the orderbys passed in
- 4: estimated cost (default a huge number)
Approximately how many disk operations are needed to provide the results. SQLite uses the cost to optimise queries. For example if the query includes A or B and A has 2,000 operations and B has 100 then it is best to evaluate B before A.
A complete example
Query is
select * from foo where price>74.99 and quantity<=10 and customer=="Acme Widgets" order by price desc, quantity asc
. Customer is column 0, price column 2 and quantity column 5. You can index on customer equality and price.BestIndex(constraints, orderbys) constraints= ( (2, apsw.SQLITE_INDEX_CONSTRAINT_GT), (5, apsw.SQLITE_INDEX_CONSTRAINT_LE), (0, apsw.SQLITE_INDEX_CONSTRAINT_EQ) ) orderbys= ( (2, True), (5, False) ) # You return ( (1, None, 0), # constraints used 27, # index number "idx_pr_cust", # index name False, # results are not in orderbys order 1000 # about 1000 disk operations to access index ) # Your Cursor.Filter method will be called with: 27, # index number you returned "idx_pr_cust", # index name you returned "Acme Widgets", # constraintarg[0] - customer 74.99 # constraintarg[1] - price
- VTTable.BestIndexObject(index_info: IndexInfo) bool
This method is called instead of
BestIndex()
if use_bestindex_object was True in the call toConnection.create_module()
.Use the
IndexInfo
to tell SQLite about your indexes, and extract other information.Return True to indicate all is well. If you return False or there is an error, then SQLITE_CONSTRAINT is returned to SQLite.
- VTTable.Commit() None
This function is used as part of transactions. You do not have to provide the method.
- VTTable.Destroy() None
The opposite of
VTModule.Create()
. This method is called when the table is no longer used. Note that you must always release resources even if you intend to return an error, as it will not be called again on error.
- VTTable.Disconnect() None
The opposite of
VTModule.Connect()
. This method is called when a reference to a virtual table is no longer used, butVTTable.Destroy()
will be called when the table is no longer used.
- VTTable.FindFunction(name: str, nargs: int) None | Callable | tuple[int, Callable]
Called to find if the virtual table has its own implementation of a particular scalar function. You do not have to provide this method.
- Parameters:
name – The function name
nargs – How many arguments the function takes
Return None if you don’t have the function. Zero is then returned to SQLite.
Return a callable if you have one. One is then returned to SQLite with the function.
Return a sequence of int, callable. The int is returned to SQLite with the function. This is useful for SQLITE_INDEX_CONSTRAINT_FUNCTION returns.
It isn’t possible to tell SQLite about exceptions in this function, so an unraisable exception is used.
See also
- VTTable.Integrity(schema: str, name: str, is_quick: int) str | None
If present, check the integrity of the virtual table.
- Parameters:
schema – Database name main, temp, the name in ATTACH
name – Name of the table
is_quick – 0 if pragma integrity_check was used, 1 if pragma quick_check was used, and may contain other values in the future.
- Returns:
None if there are no problems, else a string to be used as an error message. The string is returned to the pragma as is, so it is recommended that you include the database and table name to clarify what database and table the message is referring to.
- VTTable.Release(level: int) None
Release nested transactions back to level.
If you do not provide this method then the call succeeds (matching SQLite behaviour when no callback is provided).
- VTTable.Rename(newname: str) None
Notification that the table will be given a new name. If you return without raising an exception, then SQLite renames the table (you don’t have to do anything). If you raise an exception then the renaming is prevented. You do not have to provide this method.
- VTTable.Rollback() None
This function is used as part of transactions. You do not have to provide the method.
- VTTable.Savepoint(level: int) None
Set nested transaction to level.
If you do not provide this method then the call succeeds (matching SQLite behaviour when no callback is provided).
- VTTable.Sync() None
This function is used as part of transactions. You do not have to provide the method.
- VTTable.UpdateChangeRow(row: int, newrowid: int, fields: tuple[SQLiteValue, ...]) None
Change an existing row. You may also need to change the rowid - for example if the query was
UPDATE table SET rowid=rowid+100 WHERE ...
- Parameters:
row – The existing 64 bit integer rowid
newrowid – If not the same as row then also change the rowid to this.
fields – A tuple of values the same length and order as columns in your table
- VTTable.UpdateDeleteRow(rowid: int) None
Delete the row with the specified rowid.
- Parameters:
rowid – 64 bit integer
- VTTable.UpdateInsertRow(rowid: int | None, fields: tuple[SQLiteValue, ...]) int | None
Insert a row with the specified rowid.
- Parameters:
rowid – None if you should choose the rowid yourself, else a 64 bit integer
fields – A tuple of values the same length and order as columns in your table
- Returns:
If rowid was None then return the id you assigned to the row. If rowid was not None then the return value is ignored.
VTCursor class
- class apsw.VTCursor
Note
There is no actual VTCursor class - it is shown this way for documentation convenience and is present as a typing protocol.
The VTCursor
object is used for iterating over a table.
There may be many cursors simultaneously so each one needs to keep
track of where in the table it is.
- VTCursor.Close() None
This is the destructor for the cursor. Note that you must cleanup. The method will not be called again if you raise an exception.
- VTCursor.Column(number: int) SQLiteValue
Requests the value of the specified column number of the current row. If number is -1 then return the rowid.
- Returns:
Must be one one of the 5 supported types
- VTCursor.ColumnNoChange(number: int) SQLiteValue
VTTable.UpdateChangeRow()
is going to be called which includes values for all columns. However this column is not going to be changed in that update.If you return
apsw.no_change
thenVTTable.UpdateChangeRow()
will haveapsw.no_change
for this column. If you return anything else then it will have that value - as thoughVTCursor.Column()
had been called.This method will only be called if use_no_change was True in the call to
Connection.create_module()
.Calls: sqlite3_vtab_nochange
- VTCursor.Eof() bool
Called to ask if we are at the end of the table. It is called after each call to Filter and Next.
- Returns:
False if the cursor is at a valid row of data, else True
Note
This method can only return True or False to SQLite. If you have an exception in the method or provide a non-boolean return then True (no more data) will be returned to SQLite.
- VTCursor.Filter(indexnum: int, indexname: str, constraintargs: tuple | None) None
This method is always called first to initialize an iteration to the first row of the table. The arguments come from the
BestIndex()
orBestIndexObject()
with constraintargs being a tuple of the constraints you requested. If you always return None in BestIndex then indexnum will be zero, indexstring will be None and constraintargs will be empty).If you had an in constraint and set
IndexInfo.set_aConstraintUsage_in()
then that value will be aset
.
- VTCursor.Next() None
Move the cursor to the next row. Do not have an exception if there is no next row. Instead return False when
Eof()
is subsequently called.If you said you had indices in your
VTTable.BestIndex()
return, and they were selected for use as provided in the parameters toFilter()
then you should move to the next appropriate indexed and constrained row.