How Can You Extract Expedia Using Python And LXML? PowerPoint PPT Presentation

presentation player overlay
About This Presentation
Transcript and Presenter's Notes

Title: How Can You Extract Expedia Using Python And LXML?


1
(No Transcript)
2
Introduction
  • Collecting travel data related to flights is a
    huge task if its done manually.
  • Web extracting is the only solution to keep a
    track of data.
  • Our web extractor will scrape the flight prices
    and schedules for a source and destination.

3
Below is the listing of data fields that for
Expedia Scraper
  • Airport Arrival Destination
  • Arrival Airport Time
  • Departure Airport Destination
  • Departure Airline Time
  • Name of Plane
  • Airline
  • Duration of Flight
  • Code of Plane
  • Price of Ticket
  • Number of Stops

4
Scraping Logic
  • Build the URL to search results from Expedia
  • Download HTML for search result page utilizing
    Python request.

5
Scraping Logic
  • Parse the page utilizing LXML LXML lets you
    route the HTML Tree Structure utilizing Xpaths.
  • Save the information to JSON format.

6
Installing Pip and Python 3
  • Run the Expedia Scraper

7
Conclusion
  • This scraper must work for scraping most flight
    information feasible on Expedia unless the
    website structure changes radically.
  • If you are looking for the best scrape flight
    details from Expedia.com, then you can contact
    Scraping Intelligence for all your queries.

8
Thank You
  • Contact Scraping Intelligence for any web
    scraping services today!

Contact Us 1 281 899 0267 Email
info_at_websitescraper.com
Write a Comment
User Comments (0)
About PowerShow.com