Indexes: Map Indexes
Map
indexes, sometimes referred to as simple indexes, contain one or more mapping functions to indicate which document fields should be indexed.- After indexing, documents can be searched by the indexed fields.
-
The mapping functions can be considered the core of indexes.
-
In This Page:
Also see:
- Indexing fields from related documents
- Aggregating data with Map-Reduce indexes
- Indexing multiple collections with Multi-Map indexes
- Running calculations and storing the results in the index to reduce query time
Indexing single fields
Let's create an index that will help us search for Employees
by their FirstName
, LastName
, or both.
- First, let's create an index called
Employees/ByFirstAndLastName
Note: The naming separator character "
_
" in your code will become "/" in the index name.
In the following sample, "Employees_ByFirstAndLastName
" will become "Employees/ByFirstAndLastName" in your indexes list.
class Employees_ByFirstAndLastName extends AbstractIndexCreationTask
{
// ...
}
class Employees_ByFirstAndLastName extends AbstractJavaScriptIndexCreationTask
{
// ...
}
You might notice that we're passing Employee
as a generic parameter to AbstractIndexCreationTask
.
This gives our indexing function a strongly-typed syntax. If you are not familiar with AbstractIndexCreationTask
,
you can read this article before proceeding.
- The next step is to create the indexing function itself. This is done by setting the
map
property with our function in a parameterless constructor.
public function __construct()
{
parent::__construct();
$this->map = "docs.Employees.Select(employee => new { " .
" FirstName = employee.FirstName, " .
" LastName = employee.LastName " .
"})";
}
public function __construct()
{
parent::__construct();
$this->setMaps([
"map('Employees', function (employee){
return {
FirstName : employee.FirstName,
LastName : employee.LastName
};
})"
]);
}
- The final step is to deploy it to the server
and issue a query using the session Query method.
To query an index, the name of the index must be called by the query.
If the index isn't called, RavenDB will either use or create an auto index.
$employees1 = $session
->query(Employee::class, Employees_ByFirstAndLastName::class)
->whereEquals('FirstName', "Robert")
->toList();
$employees2 = $session
->query("Employees/ByFirstAndLastName")
->whereEquals('FirstName', "Robert")
->toList();
from index 'Employees/ByFirstAndLastName'
where FirstName = 'Robert'
This is how our final index looks like:
_1
$employees = $session
->query(Employees_ByYearOfBirth_Result::class, Employees_ByYearOfBirth::class)
->whereEquals("YearOfBirth", 1963)
->ofType(Employee::class)
->toList();
class Employees_ByFirstAndLastName extends AbstractJavaScriptIndexCreationTask
{
public function __construct()
{
parent::__construct();
$this->setMaps(["map('Employees', function (employee){
return {
FirstName : employee.FirstName,
LastName : employee.LastName
};
})"]);
}
}
Field Types
Please note that indexing capabilities are detected automatically from the returned field type from the indexing function.
For example, if our Employee
has a property named Age
that is an int
, the following indexing function...
from employee in docs.Employees
select new
{
Age = employee.Age
}
map('Employees', function(employee)
{
return {
Age : employee.Age
};
})
...grants us the capability to issue numeric queries (return all the Employees whose Age is more than 30).
Changing the Age
type to a string
will take that capability away from you. The easiest example would be to issue .ToString()
on the Age
field...
from employee in docs.Employees
select new
{
Age = employee.Age.ToString()
}
map('Employees', function(employee)
{
return {
Age : employee.Age.toString()
};
})
Convention
You will probably notice that in the Studio
, this function is a bit different from the one defined in the Employees_ByFirstAndLastName
class:
from employee in docs.Employees
select new
{
FirstName = employee.FirstName,
LastName = employee.LastName
}
The part you should pay attention to is docs.Employees
. This syntax indicates from which collection a server should take the documents for indexing. In our case, documents will be taken from the Employees
collection. To change the collection, you need to change Employees
to the desired collection name or remove it and leave only docs
to index all documents.
Combining multiple fields
Since each index contains a function, you can combine multiple fields into one.
Example
Index definition:
class Employees_ByFullName_Result {
private ?string $fullName = null;
public function getFullName(): ?string
{
return $this->fullName;
}
public function setFullName(?string $fullName): void
{
$this->fullName = $fullName;
}
}
class Employees_ByFullName extends AbstractIndexCreationTask
{
public function __construct() {
parent::__construct();
$this->map = "docs.Employees.Select(employee => new { " .
" FullName = (employee.FirstName + \" \") + employee.LastName " .
"})";
}
}
class Employees_ByFullName_Result
{
private ?string $fullName = null;
public function getFullName(): ?string
{
return $this->fullName;
}
public function setFullName(?string $fullName): void
{
$this->fullName = $fullName;
}
}
class Employees_ByFullName extends AbstractJavaScriptIndexCreationTask
{
public function __construct()
{
parent::__construct();
$this->setMaps(["map('Employees', function (employee){
return {
FullName : employee.FirstName + ' ' + employee.LastName
};
})"]);
}
}
Query the index:
// notice that we're 'cheating' here
// by marking result type in 'Query' as 'Employees_ByFullName.Result' to get strongly-typed syntax
// and changing type using 'OfType' before sending query to server
$employees = $session
->query(Employees_ByFullName_Result::class, Employees_ByFullName::class)
->whereEquals('FullName', "Robert King")
->ofType(Employee::class)
->toList();
from index 'Employees/ByFullName'
where FullName = 'Robert King'
Indexing partial field data
Imagine that you would like to return all employees that were born in a specific year.
You can do it by indexing Birthday
from Employee
, then specify the year in Birthday
as you query the index:
Index definition:
class Employees_ByBirthday_Result
{
public ?DateTime $birthday = null;
public function getBirthday(): ?DateTime
{
return $this->birthday;
}
public function setBirthday(?DateTime $birthday): void
{
$this->birthday = $birthday;
}
}
class Employees_ByBirthday extends AbstractIndexCreationTask
{
public function __construct()
{
parent::__construct();
$this->map = "docs.Employees.Select(employee => new { " .
" Birthday = employee.Birthday " .
"})";
}
}
class Employees_ByBirthday_Result
{
private ?DateTime $birthday = null;
public function getBirthday(): ?DateTime
{
return $this->birthday;
}
public function setBirthday(?DateTime $birthday): void
{
$this->birthday = $birthday;
}
}
class Employees_ByBirthday extends AbstractJavaScriptIndexCreationTask
{
public function __construct()
{
parent::__construct();
$this->setMaps([
"map('Employees', function (employee){
return {
Birthday : employee.Birthday
}
})"
]);
}
}
Query the index:
$startDate = new DateTime('1963-01-01');
$endDate = $startDate->modify('+1 year')->sub(new DateInterval('PT0.001S'));
$employees = $session
->query(Employees_ByBirthday_Result::class, Employees_ByBirthday::class)
->whereGreaterThanOrEqual("Birthday", $startDate)
->andAlso()
->whereLessThanOrEqual("Birthday", $endDate)
->ofType(Employee::class)
->toList();
from index 'Employees/ByBirthday '
where Birthday between '1963-01-01' and '1963-12-31T23:59:59.9990000'
RavenDB gives you the ability to extract field data and to index by it. A different way to achieve our goal will look as follows:
Index definition:
class Employees_ByYearOfBirth_Result {
public ?int $yearOfBirth = null;
public function getYearOfBirth(): ?int
{
return $this->yearOfBirth;
}
public function setYearOfBirth(?int $yearOfBirth): void
{
$this->yearOfBirth = $yearOfBirth;
}
}
class Employees_ByYearOfBirth extends AbstractIndexCreationTask {
public function __construct() {
parent::__construct();
$this->map = "docs.Employees.Select(employee => new { " .
" YearOfBirth = employee.Birthday.Year " .
"})";
}
}
class Employees_ByYearOfBirth_Reslut
{
private ?int $yearOfBirth = null;
public function getYearOfBirth(): ?int
{
return $this->yearOfBirth;
}
public function setYearOfBirth(?int $yearOfBirth): void
{
$this->yearOfBirth = $yearOfBirth;
}
}
class Employees_ByYearOfBirth extends AbstractJavaScriptIndexCreationTask
{
public function __construct()
{
parent::__construct();
$this->setMaps([
"map('Employees', function (employee){
return {
Birthday : employee.Birthday.Year
}
})"
]);
}
}
Query the index:
$employees = $session
->query(Employees_ByYearOfBirth_Result::class, Employees_ByYearOfBirth::class)
->whereEquals("YearOfBirth", 1963)
->ofType(Employee::class)
->toList();
from index 'Employees/ByYearOfBirth'
where YearOfBirth = 1963
Filtering data within fields
In the examples above, where_equals
is used in the query to filter the results.
If you consistently want to filter with the same filtering conditions,
you can use where_equals
in the index definition to narrow the index terms that the query must scan.
This can save query-time but narrows the terms available to query.
Example I
For logic that has to do with special import rules that only apply to the USA
where
can be used to filter the Companies collection Address.Country
field.
Thus, we only index documents where company.Address.Country == "USA"
.
Index definition:
class Employees_Query_Result {
public ?StringArray $query = null;
public function getQuery(): ?StringArray
{
return $this->query;
}
public function setQuery(?StringArray $query): void
{
$this->query = $query;
}
}
class Employees_Query extends AbstractIndexCreationTask {
public function __construct() {
parent::__construct();
$this->map = "docs.Employees.Select(employee => new { " .
" Query = new [] { employee.FirstName, employee.LastName, employee.Title, employee.Address.City } " .
"})";
$this->index("query", FieldIndexing::search());
}
}
Query the index:
$orders = $session
->query(Companies_ByAddress_Country_Result::class, Companies_ByAddress_Country::class)
->ofType(Company::class)
->toList();
from index 'Companies_ByAddress_Country'
Example II
Imagine a seed company that needs to categorize its customers by latitude-based growing zones.
They can create a different index for each zone and filter their customers in the index with
where (company.Address.Location.Latitude > 20 && company.Address.Location.Latitude < 50)
.
Index definition:
class Companies_ByAddress_Latitude_Result {
private ?float $latitude = null;
private ?float $longitude = null;
private ?string $companyName = null;
private ?string $companyAddress = null;
private ?string $companyPhone = null;
public function getLatitude(): ?float
{
return $this->latitude;
}
public function setLatitude(?float $latitude): void
{
$this->latitude = $latitude;
}
public function getLongitude(): ?float
{
return $this->longitude;
}
public function setLongitude(?float $longitude): void
{
$this->longitude = $longitude;
}
public function getCompanyName(): ?string
{
return $this->companyName;
}
public function setCompanyName(?string $companyName): void
{
$this->companyName = $companyName;
}
public function getCompanyAddress(): ?string
{
return $this->companyAddress;
}
public function setCompanyAddress(?string $companyAddress): void
{
$this->companyAddress = $companyAddress;
}
public function getCompanyPhone(): ?string
{
return $this->companyPhone;
}
public function setCompanyPhone(?string $companyPhone): void
{
$this->companyPhone = $companyPhone;
}
}
Query the index:
$orders = $session
->query(Companies_ByAddress_Latitude_Result::class, Companies_ByAddress_Latitude::class)
->ofType(Company::class)
->toList();
from index 'Companies_ByAddress_Latitude'
Indexing nested data
If your document contains nested data, e.g. Employee
contains Address
, you can index on its fields by accessing them directly in the index. Let's say that we would like to create an index that returns all employees that were born in a specific Country
:
Index definition:
class Employees_ByCountry_Result
{
private ?string $country = null;
public function getCountry(): ?string
{
return $this->country;
}
public function setCountry(?string $country): void
{
$this->country = $country;
}
}
class Employees_ByCountry extends AbstractIndexCreationTask {
public function __construct() {
parent::__construct();
$this->map = "docs.Employees.Select(employee => new { " .
" Country = employee.Address.Country " .
"})";
}
}
class Employees_ByCountry_Result
{
private ?string $country = null;
}
class Employees_ByCountry extends AbstractJavaScriptIndexCreationTask
{
public function __construct()
{
parent::__construct();
$this->setMaps([
"map('Employees', function (employee){
return {
Country : employee.Address.Country
}
})"
]);
}
}
Query the index:
$employees = $session
->query(Employees_ByCountry_Result::class, Employees_ByCountry::class)
->whereEquals("Country", "USA")
->ofType(Employee::class)
->toList();
from index 'Employees/ByCountry'
where Country = 'USA'
If a document relationship is represented by the document's ID, you can use the LoadDocument
method to retrieve such a document.
Learn more here.
Indexing Missing Fields
By default, indexes will not index a document that contains none of the specified fields. This behavior can be changed using the Indexing.IndexEmptyEntries configuration option.
The option Indexing.IndexMissingFieldsAsNull
determines whether missing fields in documents are indexed with the value null
, or not indexed at all.