Amazon Simple Storage Service (Amazon S3) provides simple, cost-effective storage at large scale
For a decade, Amazon has offered S3 as a web service, providing object storage with a simple API to store and retrieve any amount of data from anywhere on the web. S3 is used as a bulk repository, or "data lake," for analytics; for backup & recovery; for disaster recovery; and for archiving. Many cloud-based applications use S3 as primary storage.
S3 is sometimes referred to as “Amazon Buckets” because in order to upload your data (photos, videos, documents etc.), you first create a bucket in one of the AWS Regions. You can then upload any number of objects to the bucket.
The BAI Amazon S3 connector makes it possible to surface content from S3 using SharePoint Search or Azure Search, enabling users to get integrated search results that include files stored in S3. This eliminates the need to do multiple searches, which can often cause inconsistent results and decreased productivity.
A single consolidated search index referencing S3, along with content from other repositories, is surfaced as a single unified result set, giving users a single access point for searching across the enterprise.
The Amazon S3 Connector works with all content stored in S3. In addition to Amazon S3, we provide over 70 other connectors to enable enterprise-wide content to be searched . The list can be found here.
The Amazon S3 Connector:
Works with all content in S3.
S3 retains the master information and the search index has pointers to the original content so that users clicking on a search result are always working on the most current documents.
Respects the access permissions (security) set in Amazon S3. This means that the users who have sharing/access privileges assigned to them in A3 are the only ones who can see the results. Resource-based policies, specifically access control lists (ACLs), are crawled along with the objects.
In addition to supporting full crawls, the connector supports incremental crawls, which extract only content which has changed since the last crawl.
Indexes with a very high throughput.