Data organization & structure
How is the data organized?
How is the data organized?
RBID_PER— Unique person identifier. This follows the person across job changes. When someone moves to a new company, theirRBID_PERstays the same.RBID_ORG— Unique organization identifier.RBID_PAO— Unique person-at-organization identifier. This is essentiallyRBID_PERcombined withRBID_ORG, so it changes when a person moves to a new company.
RBID_PAO, but sharing the same RBID_PER.Why are RBID_ORG and RBID_PAO sometimes missing?
Why are RBID_ORG and RBID_PAO sometimes missing?
RBID_ORG.What are sparse or 'junk' profiles?
What are sparse or 'junk' profiles?
Data delivery & access
What data delivery methods are available?
What data delivery methods are available?
- Snowflake Data Sharing — We share tables directly to your Snowflake account. No data movement required; query in place.
- AWS S3 bucket — We deliver files to an S3 bucket (either one we provision for you, or your own). Access via AWS CLI.
- Gigasheet — Browser-based spreadsheet UI for searching, filtering, and exporting data without code.
How do I access my AWS Account ID?
How do I access my AWS Account ID?
How do I access my Snowflake Account ID?
How do I access my Snowflake Account ID?
What file formats are supported for S3 delivery?
What file formats are supported for S3 delivery?
- CSV — Best for spreadsheet tools, SQL imports, and lightweight pipelines
- JSON — Best for programmatic ingestion and nested data structures
- Parquet — Best for high-performance analytics and columnar query engines. Read more in Parquet’s official documentation.
How do I access the Snowflake share?
How do I access the Snowflake share?
How do I get data feed credentials?
How do I get data feed credentials?
Will credentials and paths stay the same across deliveries?
Will credentials and paths stay the same across deliveries?
delivery_date— Date of delivery inYYYYMMDDformatdataset_name— Depends on your subscription (e.g.,per,org)file_format— One ofjson,csv, orparquet
Data updates & refresh
When is the data refreshed?
When is the data refreshed?
What is the verification cadence?
What is the verification cadence?
- Profile verification — 95% of profiles are re-verified every 90 days. “Re-verified” means the source profile was accessible and we were able to confirm or update at least the core profile information (name, title, company, location). Check the
updated_atfield for the most recent verification date on any record. - Email verification — All email addresses are re-verified every 60 days. Check the
email_last_verified_atfield. A valid email is a strong signal that the associated work experience is still current.
Why are some profiles outdated or not recently updated?
Why are some profiles outdated or not recently updated?
updated_at dates for several reasons:- Source accessibility issues — The profile returned a 404 or was temporarily unreachable during the verification pass. Profiles that remain inaccessible for one year are deprecated and removed from the dataset.
- Platform scraping protections — LinkedIn periodically hides full profiles or experience sections from public view. This affects random profiles and can last days to months.
- Delayed updates by individuals — People often change jobs weeks or months before updating their profiles. Someone who starts a new role in May might not update LinkedIn until September, and we’d capture that change in October or November.
Company data & verification
How is 'claimed page' status calculated?
How is 'claimed page' status calculated?
Why do logo URLs return 403 errors?
Why do logo URLs return 403 errors?
API
How do I get an API key?
How do I get an API key?
Why do I get 401 Unauthorized?
Why do I get 401 Unauthorized?
Why do I get 429 Too Many Requests?
Why do I get 429 Too Many Requests?
What does 422 Validation Error mean?
What does 422 Validation Error mean?
detail array lists each invalid field and message. Fix those fields and retry.Troubleshooting
Data feeds
I can't connect to the data feed.
I can't connect to the data feed.
A table or column is missing.
A table or column is missing.
public) and check the Tables & Schemas and table pages (Person, Organization, Insight).InvalidAccessPointAlias error (S3)
InvalidAccessPointAlias error (S3)
- Use the AWS CLI — the Console web UI does not support
--request-payerand will return access errors - Confirm you’re targeting the correct region in your CLI command
- Verify your AWS account has been linked to the RevenueBase bucket
Access Denied or bucket not found (S3)
Access Denied or bucket not found (S3)
- Double-check that you’re including
--request-payer requesterin your CLI command - Verify your AWS credentials are configured correctly (
aws configure) - Ensure the bucket alias or ARN is copied exactly as provided — typos in the access point name are common
Snowflake share not visible
Snowflake share not visible
Missing RBID_ORG or RBID_PAO fields
Missing RBID_ORG or RBID_PAO fields
Gigasheet
Can multiple users log in with the same email?
Can multiple users log in with the same email?
Who do I contact for Gigasheet UI issues?
Who do I contact for Gigasheet UI issues?
Who do I contact for data or use-case questions?
Who do I contact for data or use-case questions?
