At Lambert Labs we specialise in aggregating structured and unstructured data from a range of sources. We use industry-standard Python tools and custom built solutions to scrape text, images and videos from the internet.
Our skills include:
- Combining the Scrapy framework with Selenium to create a powerful scraping system
- Running headless Selenium browsers on remote servers
- Using Selenium with custom-built Chrome extensions to give full access to browser content
- Deep crawling
- Building generic web crawlers that can extract content from any website