Issues with form submission or login on Nike.com using a headless browser in Python

Dave_17Sketch · December 20, 2024, 9:49pm

I’m attempting to perform quality assurance and submit forms utilizing a headless browser in Python, but it seems my libraries aren’t capable of executing the form submission successfully. Can anyone help me identify the issue?

import requests
from requests_html import HTMLSession

session = HTMLSession()

response = session.get('http://www.nike.com/us/en_us/')

print(response.html)

login_data = {
    'username': '[email protected]',
    'password': 'test-password'
}

login_response = session.post('http://www.nike.com/us/en_us/login', data=login_data)

print(login_response.status_code)

Additionally, when I tried using RoboBrowser, I encountered the following error:

Traceback (most recent call last):
  File "path_to_your_code/nike_bot.py", line 44, in <module>
    browser.submit_form(login_form)
  File "path_to_robobrowser/robobrowser/browser.py", line 341, in submit_form
    response = self.session.request(method, url, **payload.to_requests(method))
...
requests.exceptions.InvalidSchema: No connection adapters were found for 'javascript:void(0);'

Alex_Thunder · December 30, 2024, 8:34am

Hey Dave,

Using a headless browser to interact with sites like Nike can be tricky due to JavaScript-heavy pages. Here's a quick solution:

Try using Selenium with a headless browser option. Ensure you'll need a compatible driver like chromedriver or geckodriver based on your choice of browser. Example:

from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.chrome.options import Options

options = Options()
options.headless = True  # Enable headless mode

driver = webdriver.Chrome(options=options)
driver.get('http://www.nike.com/us/en_us/')

# Locate fields and perform login actions
username = driver.find_element(By.NAME, 'username')
password = driver.find_element(By.NAME, 'password')
login_button = driver.find_element(By.XPATH, '//button[@type="submit"]')

username.send_keys('[email protected]')
password.send_keys('test-password')
login_button.click()

# Check login status or page content
print(driver.title)
driver.quit()

As for the RoboBrowser error, it may not support JavaScript, thus encountering issues. Using Selenium should resolve these problems.

FlyingStar · December 29, 2024, 1:05pm

Hi Dave,

Handling JavaScript-heavy sites like Nike's requires more than just requests or RoboBrowser. I'd recommend employing Selenium for better results, especially when dealing with dynamic content and form submission.

Here's a streamlined approach using Selenium:

from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.chrome.options import Options

options = Options()
options.headless = True  # Runs browser in headless mode

driver = webdriver.Chrome(options=options)
driver.get('https://www.nike.com/')

Follow these steps to simulate a login:

Find Elements: Use By.NAME or By.XPATH to locate form fields.
Submit Form: Interact with fields and buttons directly.

# Find elements and perform actions
driver.find_element(By.NAME, 'username').send_keys('[email protected]')
driver.find_element(By.NAME, 'password').send_keys('test-password')
driver.find_element(By.XPATH, '//button[@type="submit"]').click()

# Validate the action
title = "Nike Page Title"  # Substitute with the actual title to verify login success
if driver.title == title:
    print("Login successful")
else:
    print("Login failed")

driver.quit()

This solution ensures robust handling of dynamic web elements. For your RoboBrowser issue, since it doesn't process JavaScript, switching to Selenium should effectively mitigate the problem.

CreativePainter33 · December 31, 2024, 2:28am

Hi Dave,

Handling a JavaScript-intensive site such as Nike's requires using more sophisticated techniques than what requests or RoboBrowser can offer. As suggested, Selenium is an excellent alternative, but let me propose a slightly different approach that might optimize your workflow.

Consider using Pyppeteer, a Python port of Puppeteer, which is designed for high-level browser automation. This tool addresses headless browsing effectively and allows precise control over web interactions. Here’s how you could execute a form submission using Pyppeteer:

import asyncio
from pyppeteer import launch

async def main():
    # Launch Pyppeteer
    browser = await launch(headless=True)
    page = await browser.newPage()

    # Navigate to the page
    await page.goto('https://www.nike.com/')

    # Type into login form fields
    await page.type('input[name="username"]', '[email protected]')
    await page.type('input[name="password"]', 'test-password')

    # Click the login button
    await page.click('button[type="submit"]')

    # Wait for a selector to ensure the login was successful
    await page.waitForSelector('selector-that-only-appears-after-login')

    # Output page title to verify successful navigation
    print(await page.title())

    await browser.close()

# Run the event loop
asyncio.get_event_loop().run_until_complete(main())

Using Pyppeteer brings multiple advantages:

It provides fine-grained control over web pages, useful for intricate interactions.
It can handle dynamic content better due to its intrinsic JavaScript capabilities.
It supports headless mode, making it suitable for QA and automation tasks.

Regarding your RoboBrowser issues, they likely stem from its lack of JavaScript processing capabilities. Switching to a more suitable tool, like Pyppeteer or Selenium, should resolve these challenges.

AdventurousHiker17 · January 1, 2025, 5:14am

Hey Dave,

For seamless interactions with JavaScript-heavy sites like Nike's using a headless browser in Python, Selenium or Pyppeteer are ideal choices. Here's a quick setup using Selenium:

from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.chrome.options import Options

options = Options()
options.headless = True  # Runs browser in headless mode

driver = webdriver.Chrome(options=options)
driver.get('https://www.nike.com/')

username = driver.find_element(By.NAME, 'username')
password = driver.find_element(By.NAME, 'password')
login_button = driver.find_element(By.XPATH, '//button[@type="submit"]')

username.send_keys('[email protected]')
password.send_keys('test-password')
login_button.click()

# Confirm login success
if 'desired_page_title' in driver.title:
    print('Login successful')
else:
    print('Login failed')

driver.quit()

Selenium supports dynamic content manipulation, making it more reliable for such use cases. The RoboBrowser issue is likely due to its inability to process JavaScript, hence moving to Selenium will solve that.