Skip to content

issues Search Results · repo:privacy-tech-lab/gpc-web-crawler language:JavaScript

Filter by

75 results
 (77 ms)

75 results

inprivacy-tech-lab/gpc-web-crawler (press backspace or delete to remove)

As mentioned in the previous meeting, when I did the first batch of the crawl, all the data are collected except for the Firefox s urlClassification. When I open the site manually without a crawl, I was ...
bug
discussion
invalid
  • franciscawijaya
  • 4
  • Opened 
    7 days ago
  • #122

As we have identified from the April and June crawl, there has always been sites with empty entries (No Data). . For the data for these two months, there are around 900+ empty entries. I have been brainstorming ...
data analysis
documentation
  • franciscawijaya
  • 7
  • Opened 
    17 days ago
  • #121

As GPC is required in Colorado since July 1, 2024 and will be required in Connecticut from January 1, 2025, it would make sense to also do crawls there. Mullvad has Denver, Colorado as VM location, and ...
long-term
  • SebastianZimmeck
  • Opened 
    18 days ago
  • #120

June Crawl collects this for yelp which seems like it does not successfully collect any data for Yelp.com: id : 1, site_id : 0, domain : yelp.com , sent_gpc : 1, uspapi_before_gpc : null, uspapi_after_gpc ...
crawl
data analysis
discussion
invalid
  • franciscawijaya
  • 21
  • Opened 
    on Jun 23
  • #119

I will be performing July Crawl. Taking into account the duration of June crawl and other upcoming tasks in the month of July, I am planning to start the Crawl around 8th of July so that we would have ...
crawl
  • franciscawijaya
  • 1
  • Opened 
    on Jun 23
  • #118

@franciscawijaya , currently, the readme has six sections, but the content links at the top of the readme only have four, e.g., Thank you is now section 6. Also, the Components: hangs a bit in the air ...
documentation
  • SebastianZimmeck
  • 4
  • Opened 
    on Jun 14
  • #114

Reminder for me.
documentation
  • SebastianZimmeck
  • 7
  • Opened 
    on Jun 5
  • #111

GPP 1.0 is no longer supported. If a site is broadcasting a GPP 1.0 signal, other entities on the page (eg Prebid.js or Google Ad Manager) generally will not understand it. You should just fail any site ...
core functionality
crawl
  • patmmccann
  • 23
  • Opened 
    on May 23
  • #110

@franciscawijaya will perform the crawl (with possible help from @katehausladen).
crawl
  • SebastianZimmeck
  • 12
  • Opened 
    on Apr 23
  • #108
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Restrict your search to the title by using the in:title qualifier.