Fetching all Hubspot contacts via API: Overcoming the 100-contact limit

Neo_Stars · May 7, 2025, 2:51pm

I’m working on a project where I need to grab all my Hubspot contacts and put them in a DataFrame. The problem is that the API only lets me fetch 100 contacts at a time. Here’s what I’ve got so far:

import hubspot
from pprint import pprint

client = hubspot.Client.create(api_key='my_secret_key')

try:
    result = client.crm.contacts.basic_api.get_page(limit=100, archived=False)
    pprint(result)
except Exception as oops:
    print(f'Oops, something went wrong: {oops}')

This works fine for the first 100 contacts, but I’m stuck on how to keep going. Is there a way to keep calling the API until I’ve got all the contacts? Maybe some kind of loop that stops when there are no more contacts left to fetch? I’m pretty new to working with APIs, so any help would be awesome!

Zack_45Gaming · May 18, 2025, 2:27am

yo, i’ve dealt with this before. u gotta use pagination. basically, u keep making requests with an offset until u get all contacts. sumthin like:

all_contacts = []
offset = 0
while True:
    result = client.crm.contacts.basic_api.get_page(limit=100, offset=offset)
    all_contacts.extend(result.results)
    if not result.paging:
        break
    offset += 100

this should grab everything for ya

Nova56 · May 17, 2025, 9:02pm

I’ve encountered a similar challenge when working with the HubSpot API. One effective approach is to implement a cursor-based pagination strategy. This method is more robust than offset-based pagination, especially for large datasets.

Here’s an example of how you could modify your code:

all_contacts = []
after = None
while True:
    try:
        result = client.crm.contacts.basic_api.get_page(limit=100, after=after)
        all_contacts.extend(result.results)
        if not result.paging or not result.paging.next.after:
            break
        after = result.paging.next.after
    except Exception as e:
        print(f'Error fetching contacts: {e}')
        break

print(f'Total contacts fetched: {len(all_contacts)}')

This approach uses the ‘after’ parameter to fetch subsequent pages, which is generally more efficient and less prone to issues with data changes during pagination. Remember to implement proper error handling and consider rate limiting to avoid hitting API quotas.

elizabeths · May 17, 2025, 10:49am

I’ve had success fetching all Hubspot contacts using a combination of pagination and the ‘has-more’ flag. Here’s a method that worked for me:

all_contacts = []
has_more = True
offset = 0

while has_more:
    result = client.crm.contacts.basic_api.get_page(limit=100, offset=offset)
    all_contacts.extend(result.results)
    has_more = result.paging.next.link is not None
    offset += len(result.results)

print(f'Total contacts retrieved: {len(all_contacts)}')

This approach keeps fetching contacts until there are no more to retrieve. It’s efficient and handles the API’s pagination seamlessly. Just be mindful of API rate limits and consider adding a small delay between requests if needed.

Liam_25Meditation · May 16, 2025, 10:14am

hey there, i had the same issue. try this:

contacts = []
after = None
while True:
    page = client.crm.contacts.basic_api.get_page(limit=100, after=after)
    contacts.extend(page.results)
    if not page.paging or not page.paging.next:
        break
    after = page.paging.next.after

this’ll keep goin till u got all ur contacts. just watch out for api limits!