Search SDK
Search SDK Overview
The search sdk may be used to access any functionality from the Search API. In order to use the search sdk, the hubmap_sdk package will either need imported with
import hubmap_sdk
or the search sdk class maybe be directly imported with
from hubmap_sdk.searchsdk import SearchSdk
In order to use any of the search sdk methods, create an instance of the search sdk class. This class accepts 2 arguments, both of which are optional. The first is “token”. Token is a Globus Groups token. If there is no such token, only public accessible methods will be reachable. Methods that require special access will be indicated below.
Depending on whether a token is given or, if one is given, which user groups the provided token belongs in, some or all functionality in certain methods will remain inaccessible. These too will be detailed in each method’s outline below. Validation of Globus tokens is performed when the search api is called. If, for example, a method that requires a token is used but there is no token given, the search api will return an error detailing the problem. This will also happen if a token is provided that is invalid, or if the token given is valid but is not part of the necessary user group to access part or all of a method.
If no token is given, it is assumed that there is no token.
The next argument is service_url. The Search Api has several servers: Dev, Test, Stage, and Production. It is also possible to run the search api locally. service_url is where the chosen instance of Search Api is selected. If none is provided, the production server will be used automatically. Be certain there are no typos in the service url. The urls for the different servers are as follows:
- Dev: https://search-api.dev.hubmapconsortium.org/
- Test: https://search-api.test.hubmapconsortium.org/
- Stage: https://search-api.stage.hubmapconsortium.org/
- Production: https://search.api.hubmapconsortium.org/
If using a local instance of Search Api, by default the port used is 5005. Therefore “localhost:5005” would be used for service_url.
Once this search sdk instance is created, it will be used for each method. Creating a search sdk instance will look like the following:
from hubmap_sdk import SearchSdk
#In this example, the token and service url are being retrieved from a configuration file.
url = app.config['DEV_URL']
token = app.config['GLOBUS_TOKEN']
search_instance = SearchSdk(token, url)
Search SDK Methods
The following search sdk methods each correspond with an endpoint inside the search api.
- Indices
- Search
- Search by Index
- Count
- Count by Index
- Status
- Reindex
- Reindex All
- Assay Type
- Assay Name
Indices
Description | Indices will return a list of supported indices. |
Arguments | None |
Outputs | This method outputs a list of strings. Each string is a supported index |
Error Handling | If the response code from Search Api is greater than 299, an exception will be raised. The exception message will be the response from the Search API. The indices method will return this response from the API, so if this exception is handled individually, this information can be used. If the request to the Search Api fails, an exception will be raise as well |
Authorizations | This method requires no token |
Example:
list_of_indices = searchsdk_instance.indices()
Search
Description | Executes a search with body against ElasticSearch REST API without specifying an index. Takes an ElasticSearch query in the form of a dictionary | |
Arguments | data | Optional: No |
Type: Dictionary | ||
Outputs | This method outputs the results of the search query as a dictionary | |
Error Handling | If the response code from Search Api is greater than 299, an exception will be raised. The exception message will be the response from the Search API. The search method will return this response from the API, so if this exception is handled individually, this information can be used. If the request to the Search Api fails, an exception will be raise as well | |
Authorization | No token is required for this method. If a token is provided and it is invalid, an exception will be raised. If a token is not provided, or if a valid token is provided and the token is not part of HuBMAP-Read group, only information for entities that are public will be returned |
Example:
searchbody = {
"query": {
"match": {
"uuid": "99999999999999999999999999999999"
}
}
}
search_result = searchsdk_instance.search(searchbody)
Search by Index
Description | Executes a search with body against ElasticSearch REST API with a specified index. Takes an ElasticSearch query in the form of a dictionary and an index. Note: the index in URL is not he real index in Elasticsearch, it's that index without prefix | |
Arguments | data | Optional: No |
Type: Dictionary | ||
index | Optional: No | |
Type: String | ||
Outputs | This method outputs the results of the search query as a dictionary | |
Error Handling | If the response code from Search Api is greater than 299, an exception will be raised. The exception message will be the response from the Search API. The search by index method will return this response from the API, so if this exception is handled individually, this information can be used. If the request to the Search Api fails, an exception will be raise as well | |
Authorization | No token is required for this method. If a token is provided and it is invalid, an exception will be raised. If a token is not provided, or if a valid token is provided and the token is not part of HuBMAP-Read group, the entities index will be used by default |
Example:
searchbody = {
"query": {
"match": {
"uuid": "99999999999999999999999999999999"
}
}
}
search_result = searchsdk_instance.search_by_index(searchbody, "entities")
Count
Description | Executes a 'count' query with body against ElasticSearch REST API without specifying an index. Takes an ElasticSearch query in the form of a dictionary | |
Arguments | data | Optional: No |
Type: Dictionary | ||
Outputs | This method outputs the results of the count query as a dictionary | |
Error Handling | If the response code from Search Api is greater than 299, an exception will be raised. The exception message will be the response from the Search API. The count method will return this response from the API, so if this exception is handled individually, this information can be used. If the request to the Search Api fails, an exception will be raise as well | |
Authorization | No token is required for this method. If a token is provided and it is invalid, an exception will be raised. If a token is not provided, or if a valid token is provided and the token is not part of HuBMAP-Read group, only information for entities that are public will be returned |
Example:
searchbody = {
"query": {
"match": {
"uuid": "99999999999999999999999999999999"
}
}
}
count_result = searchsdk_instance.count(searchbody)
Count by Index
Description | Executes a 'count' query with body against ElasticSearch REST API with a specified index. Takes an ElasticSearch query in the form of a dictionary and an index. Note: the index in URL is not he real index in Elasticsearch, it's that index without prefix | |
Arguments | data | Optional: No |
Type: Dictionary | ||
index | Optional: No | |
Type: String | ||
Outputs | This method outputs the results of the count query as a dictionary | |
Error Handling | If the response code from Search Api is greater than 299, an exception will be raised. The exception message will be the response from the Search API. The count by index method will return this response from the API, so if this exception is handled individually, this information can be used. If the request to the Search Api fails, an exception will be raise as well | |
Authorization | No token is required for this method. If a token is provided and it is invalid, an exception will be raised. If a token is not provided, or if a valid token is provided and the token is not part of HuBMAP-Read group, the entities index will be used by default |
Example:
searchbody = {
"query": {
"match": {
"uuid": "99999999999999999999999999999999"
}
}
}
count_result = searchsdk_instance.count_by_index(searchbody, "entities")
Status
Description | Get status will print and return the current build, elasticsearch connection, elasticsearch status, and version of the Search Api. |
Arguments | None |
Outputs | This method will print the version, build, and neo4j connection status to the terminal. It will be formatted as "'build': '{build}', 'elasticsearch_connection': '{elasticsearch_connection}', 'elasticsearch_status': '{elasticsearch_status}', 'version': '{version}'". Additionally, the complete response from search api will be returned as a dictionary. |
Error Handling | Most methods will raise an exception if either Search Api returns an error code (http status code 300 or greater) or if the connection fails altogether. Because of the nature of the status method, if either of these occur, an exception will not be raised, rather the exception or error message from Search Api will be printed to the terminal and returned. |
Authorizations | This method requires no token |
Example:
status_object = statussdk_instance.get_status()
Output:
'build: '{build}' , elasticsearch_connection: '{elasticsearch_connection}', elasticsearch_status: '{elasticsearch_status}', version: '{version}'
Reindex
Description | Performs a reindex in elasticsearch at a given uuid | |
Arguments | uuid | Optional: No |
Type: String | ||
Outputs | This method outputs a tuple that includes a message that a reindex request has been accepted by elasticsearch for the given uuid in the form of a string, as well as the status code integer returned. The response string will be: "Request of reindexing {uuid} accepted". A successful reindex request will have a status code of 202. | |
Error Handling | If the response code from Search Api is greater than 299, an exception will be raised. The exception message will be the response from the Search API. The reindex method will return this response from the API, so if this exception is handled individually, this information can be used. If the request to the Search Api fails, an exception will be raise as well | |
Authorization | A token is required for this method, however HuBMAP Data Admin group is not required for reindexing of individual uuid's. |
Example:
uuid = "99999999999999999999999999999999"
reindex_result = searchsdk_instance.reindex(uuid)
Reindex All
Description | Performs a live reindex in elasticsearch for all documents without first deleting and recreating the indices. This just deletes the old document and add the latest document of each entity (if still available) |
Arguments | None |
Outputs | This method outputs a tuple that includes a message that a reindex all request has been accepted by elasticsearch in the form of a string, as well as the status code integer returned. The response string will be: "Request of live reindex all documents accepted". A successful reindex all request will have a status code of 202. |
Error Handling | If the response code from Search Api is greater than 299, an exception will be raised. The exception message will be the response from the Search API. The reindex all method will return this response from the API, so if this exception is handled individually, this information can be used. If the request to the Search Api fails, an exception will be raise as well |
Authorizations | This method requires a token. Token must belong to HuBMAP Data-Admin group access to be able to trigger a live reindex |
Example:
reindex_result = searchsdk_instance.reindex_all()
Assay Type
Description | Returns all the dataset data or assay types information (code, description, etc.) defined from the Assay Types Yaml | |
Arguments | key | Optional: No |
Type: String | ||
Outputs | This method outputs a dictionary with the assay type information. | |
Error Handling | If the response code from Search Api is greater than 299, an exception will be raised. The exception message will be the response from the Search API. The assay type method will return this response from the API, so if this exception is handled individually, this information can be used. If the request to the Search Api fails, an exception will be raise as well | |
Authorization | No token is required for this method. If a token is provided, however, it must be valid. |
Example:
key = 'simple'
assaytype_result = searchsdk_instance.assaytype(key)
Assay Name
Description | Returns the dataset data or assay type information of a given assay name | |
Arguments | name | Optional: No |
Type: String | ||
Outputs | This method outputs a dictionary with the information for the given assay name. | |
Error Handling | If the response code from Search Api is greater than 299, an exception will be raised. The exception message will be the response from the Search API. The assay name method will return this response from the API, so if this exception is handled individually, this information can be used. If the request to the Search Api fails, an exception will be raise as well | |
Authorization | No token is required for this method. If a token is provided, however, it must be valid. |
Example:
name = 'AF_pyramid'
assayname_result = searchsdk_instance.assayname(name)