HTTP HTTP: HyperText Transfer Protocol Basis for fetching Web pages - PowerPoint PPT Presentation

HTTP: HyperText Transfer Protocol • Basis for fetching Web pages request Network CSE 461 University of Washington 2

Sir Tim Berners-Lee (1955–) • Inventor of the Web • Dominant Internet app since mid 90s • He now directs the W3C • Developed Web at CERN in ‘89 • Browser, server and first HTTP • Popularized via Mosaic (‘93), Netscape • First WWW conference in ’94 … Source: By Paul Clarke, CC-BY-2.0, via Wikimedia Commons CSE 461 University of Washington 3

Web Context Page as a set of related HTTP transactions HTTP request HTTP response Hyperlink CSE 461 University of Washington 4

Web Protocol Context • HTTP is a request/response protocol • Runs on TCP, typically port 80 • Part of browser/server app browser server request HTTP HTTP response TCP TCP IP IP 802.11 802.11

Fetching a Web page with HTTP • Start with the page URL (Uniform Resource Locator): http://en.wikipedia.org/wiki/Vegemite Protocol Server Page on server • Steps: 1. Resolve the server to IP address (DNS) 2. Set up TCP connection to the server 3. Send HTTP request for the page 4. Await HTTP response for the page 5. Execute and fetch embedded resources, render 6. Clean up any idle TCP connections CSE 461 University of Washington 6

HTML • Hypertext Markup Language (HTML) • Uses Extensible Markup Language (XML) to build a markup language for web content • Key innovation was the “hyperlink”, an element linking to other HTML elements using URLs • Also includes Cascading Style Sheets (CSS) for maintaining look-and-feel across a domain • “Browser wars” over specific standards

DOM (Document Object Model) • Base primitive for HTML browsers • Use HTML to create a tree of elements • Embedded Javascript modifies the DOM based on: • User actions • Asynchronous Javascript • Other server-side actions CSE 461 University of Washington 8

Lets explore a page • https://www.cs.washington.edu/

Static vs Dynamic Web pages • Static: Just static files, e.g., image • Dynamic: Page content based on some computation • Javascript on client, PHP on server, or both CSE 461 University of Washington 10

HTTP Protocol • Originally simple; many options added over time • Text-based commands, headers • Try it yourself: As a “browser” fetching a URL • Run “telnet <server name> 80” • Enter “GET /index.html HTTP/1.0” • Server will return HTTP response CSE 461 University of Washington 11

HTTP Protocol (2) • Commands used in the request Method Description Fetch GET Read a Web page page HEAD Read a Web page's header Upload POST Append to a Web page data PUT Store a Web page Basically defunct DELETE Remove the Web page TRACE Echo the incoming request CONNECT Connect through a proxy OPTIONS Query options for a page

HTTP Protocol (3) • Codes returned with the response Code Meaning Examples 1xx Information 100 = server agrees to handle client's request Yes! 200 = request succeeded; 204 = no content 2xx Success present 3xx Redirection 301 = page moved; 304 = cached page still valid 4xx Client error 403 = forbidden page; 404 = page not found 5xx Server error 500 = internal server error; 503 = try again later CSE 461 University of Washington 13

Representational State Transfer (REST) T) • Using HTTP for general network services • RESTful APIs: An ideal for design of HTTP-based APIs • Core tenets: • Stateless (no state on server) • Cacheable (individual URLs can be cached) • Layered (no visibility under REST hood)

Performance

PLT (Page Load Time) • PLT is a key measure of web performance • From click until user sees page • Small increases in PLT decrease sales • PLT depends on many factors • Structure of page/content • HTTP (and TCP!) protocol • Network RTT and bandwidth CSE 461 University of Washington 17

Early Performance • HTTP/1.0 used one TCP connection per web resource • Made HTTP very easy to build • But gave fairly poor PLT… CSE 461 University of Washington 18

Reasons for Poor PLT • Sequential request/responses, even when to different servers • Multiple TCP connection setups to the same server • Multiple TCP slow-start phases • Network is not used effectively • Worse with many small resources CSE 461 University of Washington 19

Ways to Improve PLT 1. Reduce content size for transfer • Smaller images, gzip 2. Make better use of the network • Next 3. Avoid fetching same content • Caching and proxies [later] 4. Move content closer to client • CDNs [later later] CSE 461 University of Washington 20

Better Network Use: Parallel Connections • Browser runs multiple (say, 8) parallel HTTP instances • Server is unchanged; already handled concurrent requests for many clients • How does this help? • Single HTTP wasn’t using network much … • So parallel connections aren’t slowed much • Pulls in completion time of last fetch CSE 461 University of Washington 21

Better Network Use: Persistent Connections • Parallel connections compete with each other for network resources • 1 parallel client ≈ 8 sequential clients? • Exacerbates network bursts, and loss • Persistent connections • Make 1 TCP connection to 1 server • Use it for multiple HTTP requests CSE 461 University of Washington 22

Persistent Connections Persistent Persistent connections + connections pipelining One request per connection CSE 461 University of Washington 23

Persistent Connections (2) • Widely used as part of HTTP/1.1 • Supports optional pipelining • PLT benefits depending on page structure, but easy on network But we didn’t stop there …. CSE 461 University of Washington 24

Web Caching and Proxies

Web Caching • Users often revisit web pages • Big win from reusing local copy, aka, caching Local copies Network Cache Server • Key question: • When is it OK to reuse local copy? CSE 461 University of Washington 26

Locally Determine Validity of Cached Content • Based on expiry information such as “Expires” header • Or a heuristic (cacheable, fresh, not modified recently) • Content is then available right away Network Cache Server CSE 461 University of Washington 27

Use Server to Validate Cached Content • Based on “Last-Modified” header from server • Or based on “Etag” header from server • Content is available after 1 RTT (if connection open) Network Cache Server CSE 461 University of Washington 28

Web Caching: Putting it together CSE 461 University of Washington 29

Web Proxies • Place intermediary between clients and servers • Benefits for clients include a shared cache • Limited by secure / dynamic content • Also limited by “long tail” • Organizational access policies too! CSE 461 University of Washington 30

Web Proxies in Action Clients contact proxy; proxy contacts server • Cache Near client Far from client CSE 461 University of Washington 31

Content Delivery Networks • As the Web took off, traffic volumes grew and grew. 1. Concentrated load on popular servers 2. Led to congested networks 3. Gave a poor user experience • Idea: • Place popular content near clients • Helps with all three issues above CSE 461 University of Washington 33

Before CDNs • Sending content from the source server to 4 users takes 4 x 3 = 12 “network hops” in the example User . . . Source User CSE 461 University of Washington 34

After CDNs • Sending content via replicas takes only 4 + 2 = 6 “network hops” User . . . Replica Source User CSE 461 University of Washington 35

After CDNs (2) • Benefits assuming popular content: • Reduces source server, network load • Improves user experience User . . . Replica Source User CSE 461 University of Washington 36

Popularity of Content • Zipf’s Law: few popular items, many George Zipf (1902-1950) unpopular ones; both matter Zipf popularity (kth item is 1/k) Rank Source: Wikipedia CSE 461 University of Washington 37

How to place content near clients? • Idea 1: Use browser and proxy caches • Helps, but limited to one client or clients in one organization • Want to place replicas across the Internet for use by all nearby clients • Idea 2: Map clients to a nearby replica • Done via clever use of DNS CSE 461 University of Washington 38

Content Delivery Network CSE 461 University of Washington 39

Content Delivery Network (2) • DNS gives different answers to clients • Tell each client the nearest replica (map client IP) CSE 461 University of Washington 40

Business Model • Clever model pioneered by Akamai • Placing site replica at an ISP is win-win • Improves site experience and reduces ISP bandwidth usage User Transit ISP Source ISP . . . Replica User CSE 461 University of Washington 41

CDNs Issues • Performance: How accurate can the IP map be? • Dynamic pages: What about dynamic content? • Security: How to cache/forward encrypted content? • Privacy: What about private information?

HTTP HTTP: HyperText Transfer Protocol Basis for fetching Web pages - PowerPoint PPT Presentation

HTTP HTTP: HyperText Transfer Protocol Basis for fetching Web pages request Network CSE 461 University of Washington 2 Sir Tim Berners-Lee (1955) Inventor of the Web Dominant Internet app since mid 90s He now directs the W3C

hypertext, multimedia and the world-wide web hypertext, multimedia and the world-wide web

Web Security, Summer Term 2012 HyperText Transfer Protocol - HTTP Dr. E. Benoist Sommer Semester

Web Security, Summer Term 2012 Secure HyperText Transfer Protocol Dr. E. Benoist Sommer Semester

2) Secure HyperText Transfer Protocol Emmanuel Benoist Fall Term 2020/2021 Berner Fachhochschule

HTTP Arthur Zaczek Aug 2015 HTTP 1 Einleitung 1.1 Definition Das Hypertext Transfer Protocol

hypertext, multimedia finding things finding things navigating hyperspace and the

WWW HTTP, Ajax, APIs, REST HTTP Hypertext Transfer Protocol Request Web Client HTTP Server

HyperText Transfer Protocol Nicolas Rogemond T-110.456 : Next generation cellular networks

A Comparative Review of HTTP/1.1, HTTP/2 & HTTP/3 December 3, 2018 Nancy Mogire WHAT

SEMANTIC CLUSTERING AND SPATIAL HYPERTEXT CMSC-676 - Pranav Ganore INTRODUCTION: Spatial

Hypertext Markup Language Introduction to Web Design Hypertext Markup Language Introduction to

Web Development PHP CSCI-GA 1122 Hypertext Preprocessor Web Development PHP CSCI-GA 1122

Hypertext Markup Language Drawing on the Web Hypertext Markup Language Drawing on the Web A

Communication and DASH Presented by: Gabriel Loewen and Vivens Ndatinya Communication

FTP File Transfer Protocol Computer Center, CS, NCTU FTP FTP File Transfer Protocol

1 2 Security Authentication Principles 3 4 Hypertext Transfer Cryptography Protocol Secure

Distributed Systems Principles and Paradigms Maarten van Steen VU Amsterdam, Dept. Computer

Web Caching based on: Web Caching , Geoff Huston Web Caching and Zipf-like Distributions:

Web Caching and Content Delivery Web Caching and Content Delivery Caching for a Better Web

A Socially Aware Caching Mechanism for Encounter Networks Future Internet Architectures: New

1945: Vannevar Bush The Internet End-End As we may think, Atlantic The Web Monthly,

and Transitive Trust Jeff Jarmoc Sr. Security Researcher Dell SecureWorks About this talk

Web Engineering HTTP-message = Request | Response generic-message = start-line *message-header

Current Status of CVMFS Fast, scalable, and reliable software distribution Jakob Blomer July

HTTP HTTP: HyperText Transfer Protocol Basis for fetching Web pages - PowerPoint PPT Presentation

HTTP HTTP: HyperText Transfer Protocol Basis for fetching Web pages request Network CSE 461 University of Washington 2 Sir Tim Berners-Lee (1955) Inventor of the Web Dominant Internet app since mid 90s He now directs the W3C

hypertext, multimedia and the world-wide web hypertext, multimedia and the world-wide web

Web Security, Summer Term 2012 HyperText Transfer Protocol - HTTP Dr. E. Benoist Sommer Semester

Web Security, Summer Term 2012 Secure HyperText Transfer Protocol Dr. E. Benoist Sommer Semester

2) Secure HyperText Transfer Protocol Emmanuel Benoist Fall Term 2020/2021 Berner Fachhochschule

HTTP Arthur Zaczek Aug 2015 HTTP 1 Einleitung 1.1 Definition Das Hypertext Transfer Protocol

hypertext, multimedia finding things finding things navigating hyperspace and the

WWW HTTP, Ajax, APIs, REST HTTP Hypertext Transfer Protocol Request Web Client HTTP Server

HyperText Transfer Protocol Nicolas Rogemond T-110.456 : Next generation cellular networks

A Comparative Review of HTTP/1.1, HTTP/2 &amp; HTTP/3 December 3, 2018 Nancy Mogire WHAT

SEMANTIC CLUSTERING AND SPATIAL HYPERTEXT CMSC-676 - Pranav Ganore INTRODUCTION: Spatial

Hypertext Markup Language Introduction to Web Design Hypertext Markup Language Introduction to

Web Development PHP CSCI-GA 1122 Hypertext Preprocessor Web Development PHP CSCI-GA 1122

Hypertext Markup Language Drawing on the Web Hypertext Markup Language Drawing on the Web A

Communication and DASH Presented by: Gabriel Loewen and Vivens Ndatinya Communication

FTP File Transfer Protocol Computer Center, CS, NCTU FTP FTP File Transfer Protocol

1 2 Security Authentication Principles 3 4 Hypertext Transfer Cryptography Protocol Secure

Distributed Systems Principles and Paradigms Maarten van Steen VU Amsterdam, Dept. Computer

Web Caching based on: Web Caching , Geoff Huston Web Caching and Zipf-like Distributions:

Web Caching and Content Delivery Web Caching and Content Delivery Caching for a Better Web

A Socially Aware Caching Mechanism for Encounter Networks Future Internet Architectures: New

1945: Vannevar Bush The Internet End-End As we may think, Atlantic The Web Monthly,

and Transitive Trust Jeff Jarmoc Sr. Security Researcher Dell SecureWorks About this talk

Web Engineering HTTP-message = Request | Response generic-message = start-line *message-header

Current Status of CVMFS Fast, scalable, and reliable software distribution Jakob Blomer July

A Comparative Review of HTTP/1.1, HTTP/2 & HTTP/3 December 3, 2018 Nancy Mogire WHAT