FAQ - RevenueBase

Data organization & structure

How is the data organized?

RevenueBase uses three core identifiers to track people, organizations, and the relationship between them:

RBID_PER — Unique person identifier. This follows the person across job changes. When someone moves to a new company, their RBID_PER stays the same.
RBID_ORG — Unique organization identifier.
RBID_PAO — Unique person-at-organization identifier. This is essentially RBID_PER combined with RBID_ORG, so it changes when a person moves to a new company.

If someone holds multiple current positions, they will have multiple rows in the contacts table — each with a unique RBID_PAO, but sharing the same RBID_PER.

Why are RBID_ORG and RBID_PAO sometimes missing?

These fields may be null when a person’s LinkedIn profile doesn’t link to an existing company page. The company name field on LinkedIn is free text — people can type anything and may skip linking to a recognized company page. Without that link, we can’t resolve the organization to a canonical RBID_ORG.

What are sparse or 'junk' profiles?

Some profiles in the dataset contain minimal information — a name and loose company association, but limited detail on title, seniority, or work history. These come from source profiles that simply contain less data. In some cases they may be incomplete or low-quality.The strongest indicator that a person genuinely belongs to an organization is a valid company email address. If a sparse profile has a verified email at the company domain, treat it as reliable. If it has no verified email, treat the association with lower confidence. See Data Freshness & Quality for more on assessing record quality.

Data delivery & access

What data delivery methods are available?

RevenueBase supports three delivery methods:

Snowflake Data Sharing — We share tables directly to your Snowflake account. No data movement required; query in place.
AWS S3 bucket — We deliver files to an S3 bucket (either one we provision for you, or your own). Access via AWS CLI.
Gigasheet — Browser-based spreadsheet UI for searching, filtering, and exporting data without code.

Azure Blob Storage and Google Cloud Storage delivery are available on request. Contact your RevenueBase account representative to set up these delivery methods.For setup instructions, see the Quickstart.

How do I access my AWS Account ID?

How do I access my Snowflake Account ID?

What file formats are supported for S3 delivery?

RevenueBase supports sending dynamic feeds in three formats: CSV, JSON, and Parquet. When you get access to the data in S3, you get access to all three formats.

CSV — Best for spreadsheet tools, SQL imports, and lightweight pipelines
JSON — Best for programmatic ingestion and nested data structures
Parquet — Best for high-performance analytics and columnar query engines. Read more in Parquet’s official documentation.

How do I access the Snowflake share?

How do I get data feed credentials?

Feed credentials are separate from your API key. In the dashboard, open Settings → Data Feeds to see connection details and credentials.

Will credentials and paths stay the same across deliveries?

Yes. For S3 deliveries, your bucket name and access credentials are permanent — they don’t change between monthly releases.Each delivery creates a new date-stamped folder following this path structure:

<delivery_date>/<dataset_name>/<file_format>/

delivery_date — Date of delivery in YYYYMMDD format
dataset_name — Depends on your subscription (e.g., per, org)
file_format — One of json, csv, or parquet

Previous deliveries remain accessible at their original paths and are never overwritten.

Data updates & refresh

When is the data refreshed?

New data releases target the 1st of each month. Depending on weekends, the actual date may shift by ±2–3 days. We send an email notification when new data is posted and ready.

What is the verification cadence?

Two verification cycles run in parallel:

Profile verification — 95% of profiles are re-verified every 90 days. “Re-verified” means the source profile was accessible and we were able to confirm or update at least the core profile information (name, title, company, location). Check the updated_at field for the most recent verification date on any record.
Email verification — All email addresses are re-verified every 60 days. Check the email_last_verified_at field. A valid email is a strong signal that the associated work experience is still current.

See Data Freshness & Quality for the full explanation of how verification works.

Why are some profiles outdated or not recently updated?

Profiles may have older updated_at dates for several reasons:

Source accessibility issues — The profile returned a 404 or was temporarily unreachable during the verification pass. Profiles that remain inaccessible for one year are deprecated and removed from the dataset.
Platform scraping protections — LinkedIn periodically hides full profiles or experience sections from public view. This affects random profiles and can last days to months.
Delayed updates by individuals — People often change jobs weeks or months before updating their profiles. Someone who starts a new role in May might not update LinkedIn until September, and we’d capture that change in October or November.

See Data Freshness & Quality for detailed guidance on working with these records.

Company data & verification

How is 'claimed page' status calculated?

We detect whether a LinkedIn company page appears to be actively managed by checking for custom content such as a logo, website address, company size, and description.This method is not 100% accurate. Some pages may have extensive information (phone numbers, addresses, founding dates) but still appear unclaimed. The challenge is that checking pages without being logged in doesn’t show LinkedIn’s unclaimed banner, so our system relies on indirect signals. We’re actively improving the accuracy of this detection.

Why do logo URLs return 403 errors?

LinkedIn periodically rotates logo URLs to protect their assets. Updated URLs are included in each monthly data refresh, though they may eventually face the same issue.The most reliable solution is to build a logo cache on your end and populate it incrementally with each delivery. This way you maintain persistent access regardless of URL rotation.

API

How do I get an API key?

Sign in to the RevenueBase dashboard, go to Settings → API Keys, and click Create API key. Copy and store the key securely. You can view and copy your API key anytime from Settings → API Keys in the dashboard.

Why do I get 401 Unauthorized?

Check that you are sending the key in the x-key header (not Authorization). Ensure the key is correct and has not been revoked.

Why do I get 429 Too Many Requests?

You have exceeded the rate limit. Wait for the window to reset or implement exponential backoff. See Rate limits and error codes.

What does 422 Validation Error mean?

The request body or parameters failed validation. The response detail array lists each invalid field and message. Fix those fields and retry.

Troubleshooting

Data feeds

I can't connect to the data feed.

Verify host, port, and TLS settings. Ensure you are using the feed username and password, not the API key. Check firewall and network rules.

A table or column is missing.

Table and schema access depend on your plan. Confirm the table name and schema (e.g., public) and check the Tables & Schemas and table pages (Person, Organization, Insight).

InvalidAccessPointAlias error (S3)

This error typically occurs when using the AWS Console instead of the AWS CLI, or when the region is misconfigured.

Use the AWS CLI — the Console web UI does not support --request-payer and will return access errors
Confirm you’re targeting the correct region in your CLI command
Verify your AWS account has been linked to the RevenueBase bucket

If the issue persists, contact support@revenuebase.ai.

Access Denied or bucket not found (S3)

Double-check that you’re including --request-payer requester in your CLI command
Verify your AWS credentials are configured correctly (aws configure)
Ensure the bucket alias or ARN is copied exactly as provided — typos in the access point name are common

Snowflake share not visible

Missing RBID_ORG or RBID_PAO fields

These fields may be null when a person’s LinkedIn profile doesn’t link to a company page. The company name on LinkedIn is free text, so some profiles will not have a resolvable organization ID. A valid company email is the strongest indicator that the person-organization association is accurate.

Gigasheet

Can multiple users log in with the same email?

No. This breaks the system. Each user must have their own individual email account on a business domain. If you have multiple business domains, you must use the email your account was originally created with — otherwise you won’t have the correct account permissions.

Who do I contact for Gigasheet UI issues?

Contact support@gigasheet.com for slow load times, exports not working, or adding/removing users.

Who do I contact for data or use-case questions?

Contact your RevenueBase Account Manager or support@revenuebase.ai for questions about your data or how to use the platform.

Still have questions?

Data Freshness & Quality

How verification works, why records become stale, and how to assess quality programmatically.

API Reference & Playground

Full API documentation, endpoint reference, and interactive playground.

Get Support

Email support@revenuebase.ai for help from the RevenueBase team.

​Data organization & structure

​Data delivery & access

​Data updates & refresh

​Company data & verification

​API

​Troubleshooting

​Data feeds

​Gigasheet

​Still have questions?