Presentation
Transcription
Presentation
VoIP Over the Internet: Is Toll Quality Achievable? Mansour Karam, Technical Lead SCV Communications Society May 12, 2004 © 2004 RouteScience Technologies, Inc. 1 Agenda • • • • • • Introduction VoIP versus “VoIP over the Internet” Challenges for VoIP over the Internet Technological advances that address today’s challenges An overview of adaptive networking technology Case studies © 2004 RouteScience Technologies, Inc. Agenda 2 Introduction: Migration to VoIP is compelling • As VoIP vendors will tell you, migration to a converged voice/data network is compelling for many reasons – More effective communications – Reduction in CapEx and OpEx – Enhanced flexibility and resiliency (in theory) • Some pointers – http://telecomreseller.com/avayaextra/ – http://www.nwfusion.com/columnists/2002/0916taylor.html – http://www1.avaya.com/enterprise/news/docs/lp/ccs.html?c=SIP&n =SIP_AvCom_ThoughtLdrship&t=internal – http://www.cisco.com/warp/public/cc/so/neso/vvda/iptl/msipt_bc.pdf • Hence the spotlight on VoIP © 2004 RouteScience Technologies, Inc. Introduction 3 Introduction: VoIP versus VoIP over the Internet • • Today’s VoIP deployments work great over local area networks • However, VoIP deployments do not work so well over the Internet, because: – Internet is a “best effort” infrastructure… – Internet infrastructure shared across a large number of competing Today’s VoIP deployments work reasonably well over dedicated (expensive) private wide area networks – Frame relay, ATM, leased lines applications with widely different characteristics – Highly demanding applications (such as Voice) experience quality and availability problems • Regardless of WAN fabric, availability is orders of magnitude away from 99.999% or “5 nines” © 2004 RouteScience Technologies, Inc. Introduction 4 Introduction: adaptive networking • Enterprises generally engineer into their network some level of redundancy • • In particular, more than one path is commonly available Adaptive networking leverages inherent redundancy, through: – Monitoring of available paths – Assessment of paths according to a set of criteria – Dynamic route adjustments to steer traffic through the path that makes the best business sense at any given time • Adaptive networking is Air Traffic Control for your WAN © 2004 RouteScience Technologies, Inc. Introduction 5 What is the public Internet? • Collection of networks that have little incentive to work together Internet © 2004 RouteScience Technologies, Inc. VoIP vs “VoIP on the Internet” 6 What is it like to use the public Internet? • • • • • • Switches Routers Firewalls VPN gateways Load balancers Packet shapers • • • • Maintenance windows Software anomalies Feature set compatibilities Technology in constant flex © 2004 RouteScience Technologies, Inc. Internet VoIP vs “VoIP on the Internet” 7 Data collected between RouteScience POP locations, June 2001 AND EWR SJC THR ASH Source: A. Markopoulou, F. Tobagi, M. Karam, "Assessing the quality of Voice Communications over Internet Backbones", IEEE Transactions on Networking, October 2003 © 2004 RouteScience Technologies, Inc. “VoIP on the Internet” data 8 Delay and loss characteristics • Delay • East coast: 3.25-11.8 ms • Coast-Colorado: 28.3-77.8 ms • Coast-to-coast: 31.3-47.2 ms – Delay variability: Delay in ms – Propagation delay: • Pattern: mainly spikes • During the day • Loss Time – Mainly outages – reliability problems • Happen at least once per day for 6 out of 7 providers • Usually preceding changes in the propagation delay Source: A. Markopoulou, F. Tobagi, M. Karam, "Assessing the quality of Voice Communications over Internet Backbones", IEEE Transactions on Networking, October 2003 © 2004 RouteScience Technologies, Inc. “VoIP on the Internet” data 9 Converting delay and loss into MOS Speech Transmission Quality (user satisfaction.) Desirable Acceptable Best (very satisfied) High (satisfied) Medium (some users dissatisfied) Low (many users dissatisfied) Poor (nearly all dissatisfied) Mean Opinion Score (MOS) 4.5 4.3 4.0 3.6 3.1 2.6 Not recommended 1 Reference: ITU-T G.107/Annex B © 2004 RouteScience Technologies, Inc. “VoIP on the Internet” data 10 Percentage of calls with MOS ≤ x Results from joint Stanford/Routescience study Threshold of acceptable quality Availability 97% Worst MOS for call MOS at the end of call 63% of the calls exhibit a worst period having unacceptable quality 3% of the calls have unacceptable quality Source: A. Markopoulou, F. Tobagi, M. Karam, "Assessing the quality of Voice Communications over Internet Backbones", IEEE Transactions on Networking, October 2003 © 2004 RouteScience Technologies, Inc. “VoIP on the Internet” data 11 Results from joint Stanford/Routescience study 12% of the calls exhibit a worst period having unacceptable quality Percentage of calls with MOS ≤ x 100 Worst MOS for call MOS at the end of call 10 2% of the calls have unacceptable quality Threshold of acceptable quality 1 Availability 1 2 MOS 3 4 98% Source: A. Markopoulou, F. Tobagi, M. Karam, "Assessing the quality of Voice Communications over Internet Backbones", IEEE Transactions on Networking, October 2003 © 2004 RouteScience Technologies, Inc. “VoIP on the Internet” data 12 Results from joint Stanford/Routescience study “… Backbones networks are over-provisioned and thus expected not to be the bottleneck on the path of a flow. Although this might be the case for data traffic, this is not always the case for VoIP traffic. We observed poor VoIP performance on a large number of ISP backbone networks under favorable end-system configurations” Source: A. Markopoulou, F. Tobagi, M. Karam, "Assessing the quality of Voice Communications over Internet Backbones", IEEE Transactions on Networking, October 2003 © 2004 RouteScience Technologies, Inc. “VoIP on the Internet” data 13 Bad minutes per month Enterprise customer case study, January 2003 345 bad minutes Brownouts: 254 minutes (74%) Blackouts: 91 minutes (26%) 26 bad seconds Internet: PSTN norm: 99.2% availability 99.999% availability © 2004 RouteScience Technologies, Inc. “VoIP on the Internet” data 14 VoIP over a private network • Major VoIP equipment vendors recommend the use of private networks: – Owned/leased networks – Frame Relay – ATM © 2004 RouteScience Technologies, Inc. Private Networks 15 Frame Relay case study: Online financial services firm VoIP over Frame Relay Headquarters ISP 1 IP/PBX IP/PBX Headquarters Eastern Data Center OC-3 OC-3 OC-3 ISP 1 Internet ISP 2 ISP 2 OC-3 IP/PBX DS-3 DS-3 Frame Relay DS3 End-to-end measurements collected for 11 days © 2004 RouteScience Technologies, Inc. Private Networks 16 Example performance problems over Frame Relay and Internet Delay spike High packet loss Link failure • Configuration Bad Minutes Reliability (%) Frame Relay 5.4 99.966 Internet 126.2 99.203 The resulting end-to-end system still does not deliver Toll Quality voice © 2004 RouteScience Technologies, Inc. Private Networks 17 Cost of private network • • • Private links are costly Private link costs increases with distance Inter-continental private link costs are very high © 2004 RouteScience Technologies, Inc. Private Networks 18 Technological landscape Technology category Adaptive networking Technology function Sidestep “brownouts” QoS Example companies RouteScience Router vendors Scheduling techniques Packet sequencing Sitara Cetacean Packet shaping Metering lights Compression Stuffing more in a packet MPLS Traffic Engineering Buffer Management schemes Filtering Packeteer Peribit Router vendors Caspian Router vendors Other… © 2004 RouteScience Technologies, Inc. Technological Advances 19 QoS example: DiffServ combined with sophisticated queue scheduling • DiffServ ToS marking allows traffic to be categorized as Voice or Data. • Scheduling gives voice traffic priority access to the resources using your favorite scheduling technique Voice Data traffic Scheduler © 2004 RouteScience Technologies, Inc. •Priority Queuing •WRR Technological Advances 20 Challenges of traditional QoS • QoS in IP networks largely remains an elusive goal, even though over-provisioning not economically viable in the long run • QoS challenges: – – – – Strict prioritization degrades under load Architecture scales, but QoS doesn’t How is provisioning done for various, different QoS requirements Hard translation from delay, jitter, loss requirements to classes of service – Requires cooperation across different ISP backbones • All the above challenges impede the actual implementation © 2004 RouteScience Technologies, Inc. Technological Advances 21 Challenges of traditional QoS “(…) The crucial issue is that we are trying to get deterministic performance for multiple classes of traffic. If the QoS story were just best effort and one premium class, then queuing mechanisms of today work (…) With multiple realtime classes—voice, video, machine-to-machine, telemetry and others—you will degrade back to best effort. It’s just too much to do the packet-by-packet routing and the queuing calculations and figure out where and in which queue to stick each packet. Forget it! It’s too complicated (…)” Peter Sevcik, Business Communications Review, September 2003 © 2004 RouteScience Technologies, Inc. Technological Advances 22 Alternative: packet sequencing • In effect, circuit switching, wherein different types of circuits can be created • How does it work? – Creates itineraries for various flows admitted to the network – Insures that the collection of itineraries meet a given schedule – A schedule is a collection of appointments, wherein an appointment consists of the deadline by which a packet of a given size is to be processed by a switch – Relies on admission control: Call only accepted if an itinerary that satisfies each router’s schedule is found © 2004 RouteScience Technologies, Inc. Technological Advances 23 Packet sequencing • Performance credentials – Sub 2-second call setup times – Has been tested to provide 99.999% (5 nines) in the lab • Challenges: – For technique to be effective, requires all network elements in the path to be capable of packet sequencing – Very expensive to deploy – Difficult to scale © 2004 RouteScience Technologies, Inc. Technological Advances 24 Applications succeed only when ALL of the infrastructure works Data Centers • Business objectives are dynamic – New applications, new partners, new policies • Applications, and users, are getting more demanding – VoIP, video conferencing • Applications are being stretched over longer distances • The wide area network is the key point of vulnerability – How do you avoid problems in fabric you don’t own? • “Brownouts” are sudden, and require instant response – How do you spot a brownout? © 2004 RouteScience Technologies, Inc. Network Infrastructure Users Adaptive networking 25 Virtualized infrastructure • Like the servers and the storage devices, the key is – Virtualization – Redundancy with intelligent “oversight” • Redundancy should also apply in the WAN: – Multiple paths – A combination of private and public links, architected as appropriate • Define policy for availability requirements – Manage the performance – cost tradeoff – Monitor / assess / adjust / notify © 2004 RouteScience Technologies, Inc. Adaptive networking 26 The key to adaptive networking Good/Bad Quality Metric • “Bad” means an application quality problem caused by the network • Application Delay Transport Delay Raw Latency, Loss, Jitter, etc • • “Star ratings” comparable across apps • Delay for a typical app transaction • Transport layer impact of low level scores • Low level measures Individual network-level metrics do not determine absolute quality What is “good” for one application type may be “bad” for another © 2004 RouteScience Technologies, Inc. Adaptive networking 27 Closing the loop with automated repair Application Quality Metrics • • • • Application needs User location User importance WAN status RouteScience © 2004 RouteScience Technologies, Inc. • • Assess alternate paths • • • React quickly Actively control network to: – Sidestep brownouts – Increase app performance – Reduce costs Maintain stability Validate effectiveness Adaptive networking 28 Managing the application through the fabric © 2004 RouteScience Technologies, Inc. 29 What adaptive networking can do for VoIP • • • • • Add a 9 to VoIP availability Eliminate 90% or more of bad minutes Eliminate network upgrades Provide WAN visibility Improve quality of 1-800 services to India © 2004 RouteScience Technologies, Inc. 30 MOS Case study: ISP problem during business hours Threshold of acceptable VoIP quality 7pm • • • Midnight 5am 10am 3pm EST All ISPs suffer unpredictable performance problems No single ISP can deliver sufficient quality for VoIP, 24x7x365 On Net is not always best © 2004 RouteScience Technologies, Inc. Case study 31 MOS Adaptive networking delivers sustainable voice quality Threshold of acceptable VoIP quality 7pm Midnight 5am 10am 3pm EST 7pm Midnight 5am 10am 3pm EST MOS Improvement 7pm © 2004 RouteScience Technologies, Inc. Threshold of acceptable VoIP quality Midnight 5am 10am 3pm Case study EST 32 Case study: Online financial services firm Eastern Data Center Headquarters ISP 1 OC-3 OC-3 OC-3 IP/PBX IP/PBX Headquarters ISP 1 Internet ISP 2 ISP 2 OC-3 IP/PBX DS-3 DS-3 Frame Relay DS3 End-to-end measurements collected for 11 days © 2004 RouteScience Technologies, Inc. Case study 33 Sample hour Adaptive networking-induced route changes: • From green to cyan • From cyan back to green Link failure Packet loss Performance problem Quality threshold 5 bad minutes for default routing 0.2 bad seconds for Optimized path © 2004 RouteScience Technologies, Inc. Case study 34 Example of delay spike RTT (ms) Delay spike Time (hour of day) © 2004 RouteScience Technologies, Inc. 35 RTT (ms) Example of delay fluctuations Adaptive networking induced route change from green to cyan Adaptive networking induced route change from cyan back to green Time (hour of day) © 2004 RouteScience Technologies, Inc. 36 Impact on VoIP availability Frame Relay • • • • Internet RS optimized Internet Configuration Bad Minutes Reliability (%) Frame Relay 5.4 99.966 Internet 126.2 99.203 Optimized Internet 14.7 99.907 Optimized path over Internet + Frame Relay 0.4 99.997 RS over Internet + Frame Relay Study comparing suitability of private, public and hybrid options Sidestep brownouts on “in-flight” calls Deliver a 10-fold increase in availability Reduce bad minutes up to 90% © 2004 RouteScience Technologies, Inc. Case study 37 General observations • All networks have quality failures – – – – – – Delay spikes Packet loss Link failures Delay fluctuations Large jitter due to layer-2 round-robin Congestion effects due to worms © 2004 RouteScience Technologies, Inc. Case study 38 General observations • • • Problems rarely occur in all networks at once • Adaptive networking effectively avoids performance problems by implementing route changes Bandwidth is clearly not the problem Performance problems result in multi-minute application outages affecting, for minutes, inter-PBX calls © 2004 RouteScience Technologies, Inc. Case study 39 Conclusion VoIP availability 99.999% Toll quality 99.99% 99.9% Availability gap Adaptive networking Underlying network 99% 90% 2001 • 2003 2005 2007 Adaptive networking fills the availability gap, allowing Toll Quality VoIP over the Internet to become a reality now © 2004 RouteScience Technologies, Inc. Conclusion 40