Self-Hosted Data
This feature is available to Enterprise-tier customers only.
If you're interested in using Studio's Self-Hosted Data capabilities, please contact the team at [email protected].
This feature is only available for certain data types.
Currently, Studio's Self-Hosted Data capabilities are available for image annotation, document transcription, and certain categorization task types. If you are interested in support for other task types, please contact the team at [email protected].
Typically, when customers upload attachments to Scale Studio, the platform processes the asset for optimization and quality purposes (for example, videos are processed for faster rendering). Self-Hosted Data allows customers to use Scale Studio without their data passing through Scaleโs servers by using private URLs.
Private URLs
Private URLs are URLs only certain people are able to access. For example, there are ways to configure URL permissions so that only employees on a VPN network are able download and view content. This can be done by configuring cloud storage bucket settings, such as S3.
Example:
{
"Version": "2012-10-17",
"Id": "S3PolicyId1",
"Statement": [
{
"Sid": "IPAllow",
"Effect": "Allow",
"Principal": "*",
"Action": "s3:*",
"Resource": [
"arn:aws:s3:::BUCKET_NAME",
"arn:aws:s3:::BUCKET_NAME/*"
],
"Condition": {
"IpAddress": {
"aws:SourceIp": "YOUR_VPN_IP_ADDRESS"
}
}
}
]
}
You can verify that these URLs can only be opened on your VPN connection. These URLs will look like https://{BUCKET_NAME}.s3.us-{REGION}.amazonaws.com/{FILENAME}
.
After generating a list of URLs, you can upload these via CSV using the attachment_url
column name.
If images are not showing up on your private network and/or you notice CORS errors in the network tab, you may need to set CORS headers.
Example CORS configuration:
[
{
"AllowedHeaders": [
"*"
],
"AllowedMethods": [
"GET",
"HEAD"
],
"AllowedOrigins": [
"*"
],
"ExposeHeaders": []
}
]
To learn more and utilize this feature, please reach out to [email protected].
Updated 11 months ago