Skip to content

Conversation

AbdElRhmanArafa
Copy link
Contributor

No description provided.

@AbdElRhmanArafa
Copy link
Contributor Author

image

Copy link
Contributor

@mazen-r mazen-r left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please resolve the conflict first, and it doesn't pass the product data is empty:

    "rate": null,
    "reviews": null,
    "soldCount": 0,
    "availableCount": null
  },
  "pricing": {
    "priceCurrency": "USD $",
    "price": null,
    "originalPrice": "No discount",
    "discount": "No discount"
  },
  "specifications": [],
  "shipping": {
    "cost": null,
    "currency": "$",
    "delivery": null
  },
  "faqs": [],
  "seller": {
    "name": null,
    "link": null,
    "id": null,
    "info": {
      "positiveFeedback": null,
      "followers": null
    }
  },

also why did you remove the cookie for the localization settings, it's good to have it to control the localization since the domain is available in different countries. If it's outdated, we should updated but not remove it

@AbdElRhmanArafa
Copy link
Contributor Author

I had checked it before opening the PR and uploaded it to aliexpress-scraper/results/product.json.
I just tested it again and it works. I’ll try to reproduce the issue and update the PR accordingly.

image

For the aep_usuc_f cookie, I removed it because it is automatically set based on the proxy country, and keeping it leads to blocks even after updating.
image

@mazen-r
Copy link
Contributor

mazen-r commented Oct 7, 2025

Are you sure you don't have cache enabled? Because I checked the request and it's actually blocked:
https://scrapfly.io/dashboard/monitoring/log/01K70ANPMZZY18PQ57RXVV4A98

could you share you request log for that successful run?
for the aep_usuc_f cookie do you still get blocked if you set a cookie with values matching the proxy country?

the log for "retrieving a session ID" here is redundant:

async def scrape_product(url: str) -> List[Product]:
    """scrape aliexpress products by id"""
    log.info("retrieving a session ID")
    log.info("scraping product: {}", url)

and the PR still has conflicts

@AbdElRhmanArafa
Copy link
Contributor Author

@mazen-r
i have push a new update
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants