SlideShare a Scribd company logo
WMB-520:  Web Technology Web Server Setup Rutgers University Center for Applied Computing Technology Instructor:  Christopher Uriarte Meeting 1: Introduction to the Internet, World Wide Web and Web Servers
Course Overview and Goals This course will teach you how to install, configure, and administer a Web server that runs on a Unix system and can be used to deliver dynamic content.  The course objectives will be achieved through a combination of lectures, demonstrations, and hands-on exercises.
About Your Instructor… Chris Uriarte – chrisjur@cju.com.  Feel free to contact me anytime via email. Homepage for this class linked from: http://www.cju.com/classes Contains all slides, notes and misc. links and resources.
What This Course Is and Is Not The purpose of the course is to teach you the concepts behind serving content on the web and how to run and administer web server software on a UNIX system. This means you will be learning how to use tools to deliver content for the World Wide Web, not to create content.  Rutgers offers other courses designed to teach you how to create content for the World Wide Web (HTML design, Perl, etc.)
Prerequisites Familiarity with a Web browser such as Netscape or Internet Explorer.  You  MUST  have user-level experience with UNIX and must be familiar with the use of a UNIX text editor like vi, emacs and pico
Class Schedule Week 1 : Introduction to the Internet, the World Wide Web, and Web Servers  Week 2 : Installing and Configuring the Apache Web Server  Week 3:  Extending Apache: Advanced Topics  Week 4:  More advanced configuration and Web Security  Note:  classes 2-4 are hands-on classes.
Course Resources Textbook:  Apache:  The Definitive Guide  by Laurie, Laurie and Denn (O’Reilly Press, 1999).  User account on UNIX server  blender.rutgers.edu .  (Remote access available via SSH) Slides and resources on class hompage, linked from http://www.cju.com/classes/
How does the World Wide Web Work? Works on a client/server model: The Web server software is the server component, the Web browser is the client component. Purpose of the Web server is to provide documents to clients.  Web servers, Web browsers, and the information that is shared between them through the Hypertext Transfer Protocol (HTTP) protocol make up the World Wide Web.
History of the World Wide Web Grew out of the Internet, a network of networks that began in the early 1970’s and was used to support a variety of services (including telnet, ftp, Usenet, email, and gopher).  Most of these services communicated via via TCP/IP (Transmission Control Protocol/Internet Protocol).  In 1989, Tim Berners-Lee at CERN developed a new system to simplify document distribution and to allow documents to be linked together. Called the “World Wide Web.”
Web History, con’t. In 1993, the National Center for Supercomputing Applications (NSCA) released to the public the NCSA Web Server software and a GUI Web browser, called Mosaic. Quickly became popular.  Mosaic became Netscape – first major web browser distribution.
The Web Today The Web has evolved greatly since it was first implemented, but it’s overall architecture has remained generally the same. Still fueled by three major components:  A network (typically the Internet, but can also include local networks or Intranets), a client component (a web browser) and a server software component (what we’re going to learn about in this class).
Webmasters, Sys-Admins and Developers, Oh My! There are a number of key roles that are necessary to support a web infrastructure.  Providing end-to-end web services requires knowledge about systems, network, software, graphical design, programming and much more. There are many different
Roles in the Web World: Web Designers – Create graphical elements and determine layout of Website.  Content Providers- Create and edit HTML documents.  Web Developers – Write web applications using programming languages such as Java, JavaScript, ASP, PHP and Perl other used to deliver dynamic content.
Roles in the Web World (con’t): System Administrators – Responsible for maintaining the Web server software and often the operating system and hardware where the Web server is installed.  Network Administrators – Responsible for the design and maintanance of network components used to deliver web content. For most organizations, these responsibilities tend to be split over multiple job positions except for very small and simple Web sites.
Hosting A Website:  The Planning Phase There are a number of key questions an organization or individual must ask when planning to deploy a website:  How and where will you host it?  What kind of hardware will you use?  What kind of Operating System will the hardware run?  What Web server software will you use?  What domain name will your site use?  Answers to above questions usually determined by budget, staffing, and existing infrastructure of your organization.
Hosting Your Website:  Options Use a Free Page Site  – For personal use, limited space and tools, typically adds advertisements. (examples:  Yahoo, Tripod, Xoom, etc.).  Limitation on amount of traffic your site is allowed.  Generally not an option for business use. Personal Page Site  – For personal use, usually included with an ISP (about $20 per month), includes small amoung of disk space, no or limited access to server-side technologies for delivering dynamic content, generally uses your ISP’s Internet domain name. Limitation on amount of traffic your site is allowed. (Website URL usually looks something like:  http://www.yourisp.com/~yourusername ).  Might be an option for very small businesses. Under both models, ISP owns and manages the server, network and web server software.  You simply provide the content.
Hosting Your Website, con’t. Virtual Hosting  – Most popular web hosting option todays.  Suitable for business or personal use. Using this model, an ISP uses one machine to host many different websites (sometimes called “shared hosting”).  You can use your own Internet domain name ( http://www.yourdomain.com ).  These typically provide a wide range of tools for building more complex Websites.  ISP owns and manages the server, network and web server software.  You simply provide the content. Overall cost based on disk usage and website traffic, ranges from $20 to several thousands of dollars a month.  Now generally available through all ISPs and specialized Hosting-only providers such as Highway Technologies ( http://www.hway.net ) and YourDomainHost ( http:// www.yourdomainhost.com )
Hosting Your Website, con’t. Dedicated Server Services  – For business use, ISP owns and the hardware and network. Your organization typically has the option of managing and configuring the server.  You provide the content.  Your organization has exclusive access to the server. Price based on the type of server you require and the amount of traffic your site uses. Co-Location Services (“co-lo”)  – For business use - Your organization owns and manages the hardware, software and content.  ISP provides you with space to place the server and the network connectivity. Price based on the amount of space your servers require, the power requirements of the servers and the amount of traffic.
Hosting Your Website, con’t. Managing your Own Web Server Network (“in-house” web hosting)  – You provision, configure and manage the network connection, hardware, software and provide the content. Most flexible option – you have complete control. Cost can be very high or very low – depending on the business need.
Hosting Your Web Server: Do It Yourself Networking Options For an Intranet Server– Need a LAN (local area network).  Does not require an Internet connection. For an Internet Server – Need a dedicated Internet connection.  Internet Connectivity Options:  POTS (up to 56Kbps) – not practical for web use anymore.  ISDN (128Kbps) – not practical for web use either (costly, slow) Cable (512Kbps – 10Mbps)  DSL (128kps – 1.54 Mbps+)  T-1 (up to 1.54Mbps) – full, fractional, or burstable  T-3 (up to 45 Mbps) – full, fractional, or burstable
Finding an ISP Some ISPs specialize in web hosting and provide all the service described earlier (shared hosting, dedicated server, co-location, etc.). Other ISPs specialize only in commercial Internet access (AOL, Earthlink, etc.).  They may provide free personal website space. Check  The List  –  http://thelist.com , for a comprehensive list ISPs and their services.
Hosting Your Server: Hardware Options There are a number of things to consider when choosing the hardware platform for your website: Need to select a machine architecture (i.e Intel Compatible PC, Sun, Macintosh) – Typically dictated by your Operating System of choice Processor speed and number of processors.  RAM and Disk Space.  NIC card.  Price can range from several hundred dollars to thousands of dollars.
Web Server Hardware Myths vs. Realities Myth Reality
Important Notes about Web Server Hardware Web Servers need fast disk access and a lot of RAM to handle high-volumes of traffic. Not unusual to see web servers with 1GB of RAM and 10,000RPM hard drives. Processor speed and performance becomes very important when delivering dynamic content via custom web application. High-end PCs can typically handle 100K+ website visits per day.
Hosting Your Server: Operating System Options Commercial Versions of Unix (i.e. Solaris HP-UX, AIX, MacOS X, IRIX).  Free Versions of Unix (i.e. Linux, FreeBSD).  Microsoft Windows (9x, NT, XP, Windows 2000).  Windows vs. Unix – raises issues of easy of use, stability, scalability, open source, and pricing.  UNIX platforms generally considered more reliable, scaleable and cost-effective.
Hosting Your Server:  Web Server Software Options The primary focus of this class will be installing and configuration the web server software – the software the turns an ordinary computer into a computer that can host and serve content on the World Wide Web. Web server software is often referred to as: The Web server The web daemon The “httpd”
Hosting Your Server: Web Server Software Options According to the Oct. 2002 Netcraft Web Server Survey ( http:// www.netcraft.com ), four Web server software distributions support over 90% of all Websites on the Internet:   Apache - 65%  Microsoft Internet Information Server - 25%  Zeus Web Server - 1.4% iPlanet – 1.3%
Web Server Software Options: Apache “ The standard” for UNIX web servers.. The most popular web server.  Considered to be the most secure, stable and robust server platform. Originally based on NCSA httpd code.  Can be installed under most Unix variants and Windows.  Binary versions available for many operating systems.  Uses file-based configuration, although some GUI tools are also available.
Introduction to Apache, con’t. Unix versions very stable. Windows version less mature, but becoming more stable. Apache 2.0 is released. Very Fast and uses resources efficiently.  Freely distributed source code. Can be modified for commercial or non-commercial use.  Price: Free.  Developed by the Apache software foundation.  See  http://www.apache.org  for more information.
Web Server Software Options: SunOne/iPlanet/Netscape Server Now officially called the SunOne Web Server Originally developed as Netscape Server, then distributed by partnership between Sun/Netscape, now owned and supported by Sun. Server packages: iPlanet/Netscape Enterprise Server, Netscape Fast-Track Server.  Runs under Windows NT, Solaris, HP-UX, Digital Unix, AIX, Linux.
Netscape/SunOne Server, con’t. Uses Web-based administration.  Can be resource intensive.  Lost major portion of market over last 5 years. Price: $1495 per CPU http://wwws.sun.com/software/products/web_srvr/home_web_srvr.html  for more information.
Web Server Software Options: Microsoft Internet Information Server (IIS) Most popular for Win NT and 2000-based web servers. Version 4 runs on Windows NT Server.  IIS version 5 runs on Windows 2000 Server (and XP, but used for development purposes only).  GUI-based administration. Web-based administration available as well.  May not scale well.
Microsoft IIS, con’t. Increasing concerns over its security. Source code not available. Extendable through Microsoft’s Internet Server API (ISAPI).  Price: Free with NT Server 4.0 and Windows 2000 Server See  http://www.microsoft.com/windows2000/technologies/web/default.asp  for more information.
How the Internet Works: Networking Basics For a Web server to be useful needs to be attached to a network.  Minimum requirements for a computer network – at least two computers that have a media and a method of communicating.  All Internet applications use TCP/IP (Transmission Control Protocol/Internet Protocol) for low-level communications.
Networking Basics: TCP/IP TCP/IP is actually a combination of 2 protocols: A transport layer protocol called the Transmission Control Protocol (TCP) A network protocol called the Internet Protocol (IP)
Networking Basics: IP Addresses TCP/IP uses IP address to identify different devices. Every computer on the Internet must have a least one unique IP address.  IPv4: IP address are four 8-bit numbers separated by dots: 165.230.30.68  Usually divided in three parts:  165.230 is one of Rutgers’ networks – e.g. no one else has addresses starting with 165.230 30 is the subnet portion of the address 68 is the particular node, or host portion of the address Division not necessarily on octet boundary.
TCP/IP:  Two Friends, Working Together IP - An IP address represents a machine’s identity on the internet and tells other machines how to get to it – similar to your street address (e.g. 123 Main Street, Anytown, USA).  TCP is a mechanism used to ensure that anything sent to a specific IP address makes it there in one piece. – similar to the Post Office. Together, TCP/IP assures that anything sent to a server on the Internet is delivered to the right place in one complete piece.
Networking Basics: IP Addresses IP addresses no longer being distributed by classes – blocks are distributed to ISPs on an as-needed basis and must be justified. IP addresses are hard to come by. How do you get them?   Your ISP received an “address space” from the ARIN (http://www.arin.org)  You receive IP addresses from your ISP.
Networking Basics: Tools Network interfaces need to be assigned IP addresses.  Interfaces can be configured using  ifconfig  command on UNIX machines.  Type  ifconfig –a  to view current configuration settings. Additional tools for network monitoring:  ping, traceroute, tcpdump, netstat, arp, snoop .
Networking Basics: DNS IP addresses are usually paired with more human-friendly names The system that contains the IP Address-to-Hostname pairing is called the  Domain Name System  (DNS).   internet.rutgers.edu Hostname Organization Top-level domain Other top-level domains include .com, .gov, .org, etc.  There are also country-specific domains like .uk, .ca, .jp, etc.
Networking Basics: DNS, con’t. Domain name information is maintained through a distributed database of host name/ IP address pairing.  The Network Information Center (NIC) manages the top-level domains and maintains a database of registered name servers for all domain names.  Host name assignments maintained through zone files on primary and secondary DNS servers controlled by the organization that owns the domain (or their ISP).
Networking Basics: DNS, con’t. Network Solutions (previously the InterNic) registers domain names – See  http://www.networksolutions.com .  Other registrars include Register.com Costs range from $20 to $50 per year. ISP’s often offer domain name registration as part of other packages (such as web hosting packages).  Need to register a primary and secondary domain name servers for your domain and arrange to have zone files created on DNS servers.  Your ISP will typically configure this for you.
DNS Overview:  If Computers on the Internet Could Talk…
Networking Basics: DNS Tools There are several tools for for monitoring DNS information: whois  – tells you the owner and primary DNS servers associated with a domain (e.g.  whois yahoo.com ).  Also available via web browser at  www.networksolutions.com .  nslookup  and  host  – tell you IP address information for a particular hostname on the internet (e.g.  nslookup www.yahoo.com  or  host www.rutgers.edu )
DNS Exercise What are IP addresses of the DNS servers that contain information about rutgers.edu? What are the IP address of: www.retaildecisions.com abusaday.admin.cju.com www.linux.org
Networking Basics: Ports Servers tend to run a number of services. A single NIC can be used to provide multiple services through  ports .  Network server software “listens” on specific ports. Clients contact server by specifying an IP address and a connection port.  The port is the “identifier” that tells a server what application a piece of network traffic is destined for. Common services and port numbers:  smtp 25, ftp 21, telnet 23, http/web 80, https/ssl 443 A list of services and ports is contained in the /etc/services file on UNIX systems. Ports below 1024 are reserved for special system services and can only be used by programs started by root (the system administrator on a UNIX system).
Uniform Resource Locator (URL) URL:  a fancy way of saying “web site address” Anatomy of a URL: http://internet.rutgers.edu:80/ITI520/index.html Protocol  Hostname  Port Number   Path To File
Unix Tools and Commands File Editors: vi, emacs, pico  File system navigation: cd  File management: mv, rm, mkdir,rmdir, ls, chmod, ln  Archiving and compression: tar, gzip ***Process management: ps, kill  Man pages available for all these commands, e.g “man rmdir”
UNIX Process Management UNIX Processes are managed using the  ps  and  kill  commands ps  is used to list processes running on the system kill  is used to kill and restart processes running on the system Every time you start a new program (pico, vi, bash, etc.) a process is created and  you  are the owner of that process. Each process is assigned a unique Process ID (PID) on the system
Process Management Exercises You can type  ps –aux  to see all the processes running on a system.  This will list the process owner, process ID (PID) and the command being run. You can  kill  any PID, as long as you are the owner of the process. ps –aux | grep username  will show all the processes your are currently running
Process Management Exercises, con’t. Open up a new UNIX terminal window and type  vi foo.txt . This will create a new process on the system that you own. Open a second terminal window on the same UNIX system.  Locate the process ID for your vi session and kill it.  What happens?
Reading for Next Week… As assigned in class…

More Related Content

PPT
Apache Web Server Setup 4
PPT
Apache Web Server Setup 3
PPT
Apache Web Server Setup 2
PPTX
Web server installation_configuration_apache
PDF
Apache Server Tutorial
PPT
Apache web server installation/configuration, Virtual Hosting
PPT
Apache ppt
PDF
Apache Web server Complete Guide
Apache Web Server Setup 4
Apache Web Server Setup 3
Apache Web Server Setup 2
Web server installation_configuration_apache
Apache Server Tutorial
Apache web server installation/configuration, Virtual Hosting
Apache ppt
Apache Web server Complete Guide

What's hot (20)

PPT
Apache
PDF
Apache web server tutorial for linux
PPT
Apache Ppt
PDF
Configuring the Apache Web Server
PDF
Apache Tutorial
PPTX
Apache web server
PPT
Apache Presentation
PPTX
Apache server configuration & optimization
PPTX
Apache web server
PDF
APACHE WEB SERVER FOR LINUX
PPT
Apache Web Server Architecture Chaitanya Kulkarni
PPT
5-WebServers.ppt
PPT
Web Server(Apache),
PPTX
Apache web server
PPTX
Apache web service
PPT
Apache HTTP Server
PPT
Apache installation and configurations
PPT
Linux Webserver Installation Command and GUI.ppt
ODP
Apache ppt
PDF
Using aphace-as-proxy-server
Apache
Apache web server tutorial for linux
Apache Ppt
Configuring the Apache Web Server
Apache Tutorial
Apache web server
Apache Presentation
Apache server configuration & optimization
Apache web server
APACHE WEB SERVER FOR LINUX
Apache Web Server Architecture Chaitanya Kulkarni
5-WebServers.ppt
Web Server(Apache),
Apache web server
Apache web service
Apache HTTP Server
Apache installation and configurations
Linux Webserver Installation Command and GUI.ppt
Apache ppt
Using aphace-as-proxy-server
Ad

Similar to Apache Web Server Setup 1 (20)

PPT
Web Technology – Web Server Setup : Chris Uriarte
PDF
web hosting
PDF
Web hosting presentations by hostindia.net
PPT
Web servers – features, installation and configuration
PPTX
WP Chap 1 & 2.pptx
PPTX
Web server hardware and software
PPSX
Web server
PPT
AI using Web Design
PPTX
05.m3 cms list-ofwebserver
PPT
PPT
Web Server Hardware and Software
PPT
Web Fendamentals
PPTX
Web-Server & It's Architecture.pptx
PPT
Web Servers, Browsers, Server - Browser Interaction, Web Surfing
ODP
web server
PPT
webhostlecture (1).ppt
PPT
webhostlecture (1).ppt
PPTX
Web Browsers.pptx
PPTX
Web application development ( basics )
PPTX
Lecture 1- Introduction to Computers and the Internet.pptx
Web Technology – Web Server Setup : Chris Uriarte
web hosting
Web hosting presentations by hostindia.net
Web servers – features, installation and configuration
WP Chap 1 & 2.pptx
Web server hardware and software
Web server
AI using Web Design
05.m3 cms list-ofwebserver
Web Server Hardware and Software
Web Fendamentals
Web-Server & It's Architecture.pptx
Web Servers, Browsers, Server - Browser Interaction, Web Surfing
web server
webhostlecture (1).ppt
webhostlecture (1).ppt
Web Browsers.pptx
Web application development ( basics )
Lecture 1- Introduction to Computers and the Internet.pptx
Ad

More from Information Technology (20)

PDF
Sql Server Security Best Practices
PPT
PPT
SQL 2005 Disk IO Performance
PPT
PPT
Review of SQL
PPT
Sql 2005 high availability
PPT
IIS 7: The Administrator’s Guide
PPT
MOSS 2007 Deployment Fundamentals -Part2
PPT
MOSS 2007 Deployment Fundamentals -Part1
PPT
Clustering and High Availability
PDF
F5 beyond load balancer (nov 2009)
PPT
WSS 3.0 & SharePoint 2007
PPT
SharePoint Topology
PDF
Sharepoint Deployments
PPT
Microsoft Clustering
PDF
Scalable Internet Servers and Load Balancing
PPT
PPT
Migration from ASP to ASP.NET
Sql Server Security Best Practices
SQL 2005 Disk IO Performance
Review of SQL
Sql 2005 high availability
IIS 7: The Administrator’s Guide
MOSS 2007 Deployment Fundamentals -Part2
MOSS 2007 Deployment Fundamentals -Part1
Clustering and High Availability
F5 beyond load balancer (nov 2009)
WSS 3.0 & SharePoint 2007
SharePoint Topology
Sharepoint Deployments
Microsoft Clustering
Scalable Internet Servers and Load Balancing
Migration from ASP to ASP.NET

Recently uploaded (20)

PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Tartificialntelligence_presentation.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
Spectroscopy.pptx food analysis technology
PPTX
A Presentation on Artificial Intelligence
PPTX
Machine Learning_overview_presentation.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
1. Introduction to Computer Programming.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
MYSQL Presentation for SQL database connectivity
Tartificialntelligence_presentation.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
A comparative analysis of optical character recognition models for extracting...
Spectral efficient network and resource selection model in 5G networks
MIND Revenue Release Quarter 2 2025 Press Release
Building Integrated photovoltaic BIPV_UPV.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Spectroscopy.pptx food analysis technology
A Presentation on Artificial Intelligence
Machine Learning_overview_presentation.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
1. Introduction to Computer Programming.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Big Data Technologies - Introduction.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
20250228 LYD VKU AI Blended-Learning.pptx

Apache Web Server Setup 1

  • 1. WMB-520: Web Technology Web Server Setup Rutgers University Center for Applied Computing Technology Instructor: Christopher Uriarte Meeting 1: Introduction to the Internet, World Wide Web and Web Servers
  • 2. Course Overview and Goals This course will teach you how to install, configure, and administer a Web server that runs on a Unix system and can be used to deliver dynamic content. The course objectives will be achieved through a combination of lectures, demonstrations, and hands-on exercises.
  • 3. About Your Instructor… Chris Uriarte – [email protected]. Feel free to contact me anytime via email. Homepage for this class linked from: http://www.cju.com/classes Contains all slides, notes and misc. links and resources.
  • 4. What This Course Is and Is Not The purpose of the course is to teach you the concepts behind serving content on the web and how to run and administer web server software on a UNIX system. This means you will be learning how to use tools to deliver content for the World Wide Web, not to create content. Rutgers offers other courses designed to teach you how to create content for the World Wide Web (HTML design, Perl, etc.)
  • 5. Prerequisites Familiarity with a Web browser such as Netscape or Internet Explorer. You MUST have user-level experience with UNIX and must be familiar with the use of a UNIX text editor like vi, emacs and pico
  • 6. Class Schedule Week 1 : Introduction to the Internet, the World Wide Web, and Web Servers Week 2 : Installing and Configuring the Apache Web Server Week 3: Extending Apache: Advanced Topics Week 4: More advanced configuration and Web Security Note: classes 2-4 are hands-on classes.
  • 7. Course Resources Textbook: Apache: The Definitive Guide by Laurie, Laurie and Denn (O’Reilly Press, 1999). User account on UNIX server blender.rutgers.edu . (Remote access available via SSH) Slides and resources on class hompage, linked from http://www.cju.com/classes/
  • 8. How does the World Wide Web Work? Works on a client/server model: The Web server software is the server component, the Web browser is the client component. Purpose of the Web server is to provide documents to clients. Web servers, Web browsers, and the information that is shared between them through the Hypertext Transfer Protocol (HTTP) protocol make up the World Wide Web.
  • 9. History of the World Wide Web Grew out of the Internet, a network of networks that began in the early 1970’s and was used to support a variety of services (including telnet, ftp, Usenet, email, and gopher). Most of these services communicated via via TCP/IP (Transmission Control Protocol/Internet Protocol). In 1989, Tim Berners-Lee at CERN developed a new system to simplify document distribution and to allow documents to be linked together. Called the “World Wide Web.”
  • 10. Web History, con’t. In 1993, the National Center for Supercomputing Applications (NSCA) released to the public the NCSA Web Server software and a GUI Web browser, called Mosaic. Quickly became popular. Mosaic became Netscape – first major web browser distribution.
  • 11. The Web Today The Web has evolved greatly since it was first implemented, but it’s overall architecture has remained generally the same. Still fueled by three major components: A network (typically the Internet, but can also include local networks or Intranets), a client component (a web browser) and a server software component (what we’re going to learn about in this class).
  • 12. Webmasters, Sys-Admins and Developers, Oh My! There are a number of key roles that are necessary to support a web infrastructure. Providing end-to-end web services requires knowledge about systems, network, software, graphical design, programming and much more. There are many different
  • 13. Roles in the Web World: Web Designers – Create graphical elements and determine layout of Website. Content Providers- Create and edit HTML documents. Web Developers – Write web applications using programming languages such as Java, JavaScript, ASP, PHP and Perl other used to deliver dynamic content.
  • 14. Roles in the Web World (con’t): System Administrators – Responsible for maintaining the Web server software and often the operating system and hardware where the Web server is installed. Network Administrators – Responsible for the design and maintanance of network components used to deliver web content. For most organizations, these responsibilities tend to be split over multiple job positions except for very small and simple Web sites.
  • 15. Hosting A Website: The Planning Phase There are a number of key questions an organization or individual must ask when planning to deploy a website: How and where will you host it? What kind of hardware will you use? What kind of Operating System will the hardware run? What Web server software will you use? What domain name will your site use? Answers to above questions usually determined by budget, staffing, and existing infrastructure of your organization.
  • 16. Hosting Your Website: Options Use a Free Page Site – For personal use, limited space and tools, typically adds advertisements. (examples: Yahoo, Tripod, Xoom, etc.). Limitation on amount of traffic your site is allowed. Generally not an option for business use. Personal Page Site – For personal use, usually included with an ISP (about $20 per month), includes small amoung of disk space, no or limited access to server-side technologies for delivering dynamic content, generally uses your ISP’s Internet domain name. Limitation on amount of traffic your site is allowed. (Website URL usually looks something like: http://www.yourisp.com/~yourusername ). Might be an option for very small businesses. Under both models, ISP owns and manages the server, network and web server software. You simply provide the content.
  • 17. Hosting Your Website, con’t. Virtual Hosting – Most popular web hosting option todays. Suitable for business or personal use. Using this model, an ISP uses one machine to host many different websites (sometimes called “shared hosting”). You can use your own Internet domain name ( http://www.yourdomain.com ). These typically provide a wide range of tools for building more complex Websites. ISP owns and manages the server, network and web server software. You simply provide the content. Overall cost based on disk usage and website traffic, ranges from $20 to several thousands of dollars a month. Now generally available through all ISPs and specialized Hosting-only providers such as Highway Technologies ( http://www.hway.net ) and YourDomainHost ( http:// www.yourdomainhost.com )
  • 18. Hosting Your Website, con’t. Dedicated Server Services – For business use, ISP owns and the hardware and network. Your organization typically has the option of managing and configuring the server. You provide the content. Your organization has exclusive access to the server. Price based on the type of server you require and the amount of traffic your site uses. Co-Location Services (“co-lo”) – For business use - Your organization owns and manages the hardware, software and content. ISP provides you with space to place the server and the network connectivity. Price based on the amount of space your servers require, the power requirements of the servers and the amount of traffic.
  • 19. Hosting Your Website, con’t. Managing your Own Web Server Network (“in-house” web hosting) – You provision, configure and manage the network connection, hardware, software and provide the content. Most flexible option – you have complete control. Cost can be very high or very low – depending on the business need.
  • 20. Hosting Your Web Server: Do It Yourself Networking Options For an Intranet Server– Need a LAN (local area network). Does not require an Internet connection. For an Internet Server – Need a dedicated Internet connection. Internet Connectivity Options: POTS (up to 56Kbps) – not practical for web use anymore. ISDN (128Kbps) – not practical for web use either (costly, slow) Cable (512Kbps – 10Mbps) DSL (128kps – 1.54 Mbps+) T-1 (up to 1.54Mbps) – full, fractional, or burstable T-3 (up to 45 Mbps) – full, fractional, or burstable
  • 21. Finding an ISP Some ISPs specialize in web hosting and provide all the service described earlier (shared hosting, dedicated server, co-location, etc.). Other ISPs specialize only in commercial Internet access (AOL, Earthlink, etc.). They may provide free personal website space. Check The List – http://thelist.com , for a comprehensive list ISPs and their services.
  • 22. Hosting Your Server: Hardware Options There are a number of things to consider when choosing the hardware platform for your website: Need to select a machine architecture (i.e Intel Compatible PC, Sun, Macintosh) – Typically dictated by your Operating System of choice Processor speed and number of processors. RAM and Disk Space. NIC card. Price can range from several hundred dollars to thousands of dollars.
  • 23. Web Server Hardware Myths vs. Realities Myth Reality
  • 24. Important Notes about Web Server Hardware Web Servers need fast disk access and a lot of RAM to handle high-volumes of traffic. Not unusual to see web servers with 1GB of RAM and 10,000RPM hard drives. Processor speed and performance becomes very important when delivering dynamic content via custom web application. High-end PCs can typically handle 100K+ website visits per day.
  • 25. Hosting Your Server: Operating System Options Commercial Versions of Unix (i.e. Solaris HP-UX, AIX, MacOS X, IRIX). Free Versions of Unix (i.e. Linux, FreeBSD). Microsoft Windows (9x, NT, XP, Windows 2000). Windows vs. Unix – raises issues of easy of use, stability, scalability, open source, and pricing. UNIX platforms generally considered more reliable, scaleable and cost-effective.
  • 26. Hosting Your Server: Web Server Software Options The primary focus of this class will be installing and configuration the web server software – the software the turns an ordinary computer into a computer that can host and serve content on the World Wide Web. Web server software is often referred to as: The Web server The web daemon The “httpd”
  • 27. Hosting Your Server: Web Server Software Options According to the Oct. 2002 Netcraft Web Server Survey ( http:// www.netcraft.com ), four Web server software distributions support over 90% of all Websites on the Internet: Apache - 65% Microsoft Internet Information Server - 25% Zeus Web Server - 1.4% iPlanet – 1.3%
  • 28. Web Server Software Options: Apache “ The standard” for UNIX web servers.. The most popular web server. Considered to be the most secure, stable and robust server platform. Originally based on NCSA httpd code. Can be installed under most Unix variants and Windows. Binary versions available for many operating systems. Uses file-based configuration, although some GUI tools are also available.
  • 29. Introduction to Apache, con’t. Unix versions very stable. Windows version less mature, but becoming more stable. Apache 2.0 is released. Very Fast and uses resources efficiently. Freely distributed source code. Can be modified for commercial or non-commercial use. Price: Free. Developed by the Apache software foundation. See http://www.apache.org for more information.
  • 30. Web Server Software Options: SunOne/iPlanet/Netscape Server Now officially called the SunOne Web Server Originally developed as Netscape Server, then distributed by partnership between Sun/Netscape, now owned and supported by Sun. Server packages: iPlanet/Netscape Enterprise Server, Netscape Fast-Track Server. Runs under Windows NT, Solaris, HP-UX, Digital Unix, AIX, Linux.
  • 31. Netscape/SunOne Server, con’t. Uses Web-based administration. Can be resource intensive. Lost major portion of market over last 5 years. Price: $1495 per CPU http://wwws.sun.com/software/products/web_srvr/home_web_srvr.html for more information.
  • 32. Web Server Software Options: Microsoft Internet Information Server (IIS) Most popular for Win NT and 2000-based web servers. Version 4 runs on Windows NT Server. IIS version 5 runs on Windows 2000 Server (and XP, but used for development purposes only). GUI-based administration. Web-based administration available as well. May not scale well.
  • 33. Microsoft IIS, con’t. Increasing concerns over its security. Source code not available. Extendable through Microsoft’s Internet Server API (ISAPI). Price: Free with NT Server 4.0 and Windows 2000 Server See http://www.microsoft.com/windows2000/technologies/web/default.asp for more information.
  • 34. How the Internet Works: Networking Basics For a Web server to be useful needs to be attached to a network. Minimum requirements for a computer network – at least two computers that have a media and a method of communicating. All Internet applications use TCP/IP (Transmission Control Protocol/Internet Protocol) for low-level communications.
  • 35. Networking Basics: TCP/IP TCP/IP is actually a combination of 2 protocols: A transport layer protocol called the Transmission Control Protocol (TCP) A network protocol called the Internet Protocol (IP)
  • 36. Networking Basics: IP Addresses TCP/IP uses IP address to identify different devices. Every computer on the Internet must have a least one unique IP address. IPv4: IP address are four 8-bit numbers separated by dots: 165.230.30.68 Usually divided in three parts: 165.230 is one of Rutgers’ networks – e.g. no one else has addresses starting with 165.230 30 is the subnet portion of the address 68 is the particular node, or host portion of the address Division not necessarily on octet boundary.
  • 37. TCP/IP: Two Friends, Working Together IP - An IP address represents a machine’s identity on the internet and tells other machines how to get to it – similar to your street address (e.g. 123 Main Street, Anytown, USA). TCP is a mechanism used to ensure that anything sent to a specific IP address makes it there in one piece. – similar to the Post Office. Together, TCP/IP assures that anything sent to a server on the Internet is delivered to the right place in one complete piece.
  • 38. Networking Basics: IP Addresses IP addresses no longer being distributed by classes – blocks are distributed to ISPs on an as-needed basis and must be justified. IP addresses are hard to come by. How do you get them? Your ISP received an “address space” from the ARIN (http://www.arin.org) You receive IP addresses from your ISP.
  • 39. Networking Basics: Tools Network interfaces need to be assigned IP addresses. Interfaces can be configured using ifconfig command on UNIX machines. Type ifconfig –a to view current configuration settings. Additional tools for network monitoring: ping, traceroute, tcpdump, netstat, arp, snoop .
  • 40. Networking Basics: DNS IP addresses are usually paired with more human-friendly names The system that contains the IP Address-to-Hostname pairing is called the Domain Name System (DNS). internet.rutgers.edu Hostname Organization Top-level domain Other top-level domains include .com, .gov, .org, etc. There are also country-specific domains like .uk, .ca, .jp, etc.
  • 41. Networking Basics: DNS, con’t. Domain name information is maintained through a distributed database of host name/ IP address pairing. The Network Information Center (NIC) manages the top-level domains and maintains a database of registered name servers for all domain names. Host name assignments maintained through zone files on primary and secondary DNS servers controlled by the organization that owns the domain (or their ISP).
  • 42. Networking Basics: DNS, con’t. Network Solutions (previously the InterNic) registers domain names – See http://www.networksolutions.com . Other registrars include Register.com Costs range from $20 to $50 per year. ISP’s often offer domain name registration as part of other packages (such as web hosting packages). Need to register a primary and secondary domain name servers for your domain and arrange to have zone files created on DNS servers. Your ISP will typically configure this for you.
  • 43. DNS Overview: If Computers on the Internet Could Talk…
  • 44. Networking Basics: DNS Tools There are several tools for for monitoring DNS information: whois – tells you the owner and primary DNS servers associated with a domain (e.g. whois yahoo.com ). Also available via web browser at www.networksolutions.com . nslookup and host – tell you IP address information for a particular hostname on the internet (e.g. nslookup www.yahoo.com or host www.rutgers.edu )
  • 45. DNS Exercise What are IP addresses of the DNS servers that contain information about rutgers.edu? What are the IP address of: www.retaildecisions.com abusaday.admin.cju.com www.linux.org
  • 46. Networking Basics: Ports Servers tend to run a number of services. A single NIC can be used to provide multiple services through ports . Network server software “listens” on specific ports. Clients contact server by specifying an IP address and a connection port. The port is the “identifier” that tells a server what application a piece of network traffic is destined for. Common services and port numbers: smtp 25, ftp 21, telnet 23, http/web 80, https/ssl 443 A list of services and ports is contained in the /etc/services file on UNIX systems. Ports below 1024 are reserved for special system services and can only be used by programs started by root (the system administrator on a UNIX system).
  • 47. Uniform Resource Locator (URL) URL: a fancy way of saying “web site address” Anatomy of a URL: http://internet.rutgers.edu:80/ITI520/index.html Protocol Hostname Port Number Path To File
  • 48. Unix Tools and Commands File Editors: vi, emacs, pico File system navigation: cd File management: mv, rm, mkdir,rmdir, ls, chmod, ln Archiving and compression: tar, gzip ***Process management: ps, kill Man pages available for all these commands, e.g “man rmdir”
  • 49. UNIX Process Management UNIX Processes are managed using the ps and kill commands ps is used to list processes running on the system kill is used to kill and restart processes running on the system Every time you start a new program (pico, vi, bash, etc.) a process is created and you are the owner of that process. Each process is assigned a unique Process ID (PID) on the system
  • 50. Process Management Exercises You can type ps –aux to see all the processes running on a system. This will list the process owner, process ID (PID) and the command being run. You can kill any PID, as long as you are the owner of the process. ps –aux | grep username will show all the processes your are currently running
  • 51. Process Management Exercises, con’t. Open up a new UNIX terminal window and type vi foo.txt . This will create a new process on the system that you own. Open a second terminal window on the same UNIX system. Locate the process ID for your vi session and kill it. What happens?
  • 52. Reading for Next Week… As assigned in class…