Full-Text Search


  • This article is about running a full-text search with a dynamic query.
    To learn how to run a full-text search using a static-index, see full-text search with index.

  • Use the search() method to query for documents that contain specified term/s within the text of the specified document field/s.

  • When running a full-text search with a dynamic query, the auto-index created by the server breaks down the text of the searched document field using the default search analyzer.
    All generated terms are lower-cased, so the search is case-insensitive.

  • Gain additional control over term tokenization by running a full-text search using a static-index, where the used analyzer is configurable.

  • A boost value can be set for each search to prioritize results. Learn more in boost search results.

  • User experience can be enhanced by requesting text fragments that highlight the searched terms in the results. Learn more in highlight search results.



Search for single term

employees = list(
    session
    # Make a dynamic query on Employees collection
    .query(object_type=Employee)
    # * Call 'Search' to make a Full-Text search
    # * Search is case-insensitive
    # * Look for documents containing the term 'University' within their 'Notes' field
    .search("Notes", "University")
)
# Results will contain Employee documents that have
# any case variation of the term 'university' in their 'Notes' field.
from "Employees"
where search(Notes, "University")

  • Executing the above query will generate the auto-index Auto/Employees/BySearch(Notes).

  • This auto-index will contain the following two index-fields:

    • Notes
      Contains terms with the original text from the indexed document field 'Notes'.
      Text is lower-cased and Not tokenized.

    • search(Notes)
      Contains lower-cased terms that were tokenized from the 'Notes' field by the default search analyzer (RavenStandardAnalyzer). Calling the search() method targets these terms to find matching documents.

Search for multiple terms

  • You can search for multiple terms in the same field in a single search method.

  • By default, the logical operator between these terms is 'OR'.

  • This behavior can be modified. See section Search operators.

Pass terms in a string:

employees = list(
    session.query(object_type=Employee)
    # * Pass multiple terms in a single string, separated by spaces.
    # * Look for documents containing either 'University' OR 'Sales' OR 'Japanese'
    #   within their 'Notes' field
    .search("Notes", "University Sales Japanese")
)

# * Results will contain Employee documents that have at least one of the specified terms.
# * Search is case-insensitive.
from "Employees"
where search(Notes, "University Sales Japanese")

Search in multiple fields

  • You can search for terms in different fields by making multiple search calls.

  • By default, the logical operator between consecutive search methods is 'OR'.

  • This behavior can be modified. See section Search options.

employees = list(
    session.query(object_type=Employee)
    # * Look for documents containing:
    #   'French' in their 'Notes' field OR 'President' in their 'Title' field
    .search("Notes", "French").search("Title", "President")
)
# * Results will contain Employee documents that have
#   at least one of the specified fields with the specified terms.
# * Search is case-insensitive.
from "Employees"
where (search(Notes, "French") or search(Title, "President"))

Search in complex object

  • You can search for terms within a complex object.

  • Any nested text field within the object is searchable.

companies = list(
    session.query(object_type=Company)
    # * Look for documents that contain:
    #   the term 'USA' OR 'London' in any field within the complex 'Address' object
    .search("Address", "USA London")
)
from "Companies"
where search(Address, "USA London")

Search operators

  • By default, the logical operator between multiple terms within the same field in a search call is OR.

  • This can be modified using the @operator parameter as follows:

AND:

employees = list(
    session.query(object_type=Employee)
    # * Pass `@operator` with 'SearchOperator.AND'
    .search("Notes", "College German", operator=SearchOperator.AND)
)
# * Results will contain Employee documents that have BOTH 'College' AND 'German'
#   in their 'Notes' field.
# * Search is case-insensitive.
from "Employees"
where search(Notes, "College German", and)

OR:

employees = list(
    session.query(object_type=Employee)
    # * Pass `@operator` with 'SearchOperator.OR' (or don't pass this param at all)
    .search("Notes", "College German", operator=SearchOperator.OR)
)
# * Results will contain Employee documents that have BOTH 'College' OR 'German'
#   in their 'Notes' field.
# * Search is case-insensitive.
from "Employees"
where search(Notes, "College German")

Search options

Search options allow to:

  • Negate a search criteria.
  • Specify the logical operator used between consecutive search calls.

Negate search:

companies = list(
    session.query(object_type=Company)
    .open_subclause()
    # Call 'Not' to negate the next search call
    .not_()
    .search("Address", "USA")
    .close_subclause()
)
# * Results will contain Company documents are NOT located in 'USA'
# * Search is case-insensitive
from "Companies"
where (exists(Address) and not search(Address, "USA"))

Default behavior between search calls:

  • By default, the logical operator between consecutive search methods is OR.

companies = list(
    session.query(object_type=Company).where_equals("Contact.Title", "Owner")
    # Operator AND will be used with previous 'where_equals' predicate
    .search("Address.Country", "France")
    # Operator OR will be used between the two 'search' calls by default
    .search("Name", "Markets")
)

# * Results will contain Company documents that have:
#   ('Owner' as the 'Contact.Title')
#   AND
#   (are located in 'France' OR have 'Markets' in their 'Name' field)
#
# * Search is case-insensitive
from "Companies" 
where Contact.Title == "Owner" and
(search(Address.Country, "France") or search(Name, "Markets"))

Use options as bit flags:

employees = list(
    session.advanced.document_query(object_type=Employee)
    .search("Notes", "French")
    # Call 'AndAlso' so that operator AND will be used with previous 'Search' call
    .and_also()
    .open_subclause()
    # Call 'Not' to negate the next search call
    .not_()
    .search("Title", "Manager")
    .close_subclause()
)

# * Results will contain Employee documents that have:
#   ('French' in their 'Notes' field)
#   AND
#   (do NOT have 'Manager' in their 'Title' field)
#
# * Search is case-insensitive
from "Employees"
where search(Notes, "French") and
(exists(Title) and not search(Title, "Manager"))

Using wildcards

  • Wildcards can be used to replace:

    • Prefix of a searched term
    • Postfix of a searched term
    • Both prefix & postfix
  • Note:

    • Searching with a wildcard as the prefix of the term (e.g. *text) is not advised as it will cause the server to perform a full index scan.

    • Instead, consider using a static-index that indexes the field in reverse order
      and then query with a wildcard as the postfix, which is much faster.

employees = list(
    session.query(object_type=Employee)
    # Use '*' to replace one or more characters
    .search("Notes", "art*")
    .search("Notes", "*logy")
    .search("Notes", "*mark*")
)

# Results will contain Employee documents that have in their 'Notes' field:
# (terms that start with 'art')  OR
# (terms that end with 'logy') OR
# (terms that have the text 'mark' in the middle)
#
# * Search is case-insensitive
from "Employees" where
search(Notes, "art*") or
search(Notes, "*logy") or
search(Notes, "*mark*")

Syntax

def search(self, field_name: str, search_terms: str, operator: SearchOperator = None) -> DocumentQuery[_T]: ...
Parameter Type Description
field_name str Name of the searched field.
search_terms str A string containing the term or terms (separated by spaces) to search for.
operator SearchOperator Logical operator to use between multiple terms in the same Search method.
Can be: SearchOperator.OR or SearchOperator.AND
Default: SearchOperation.OR
Return Type Description
DocumentQuery[_T] The same object used for the query