We Replaced an SSD with Storage Class Memory. Here is What We Learned.

On April 2, 2019 Intel Optane Persistent Memory became the first commercially available storage class memory (SCM). SCM is a storage device that sits on a memory bus; in contrast, traditional storage devices like SSD are attached to the PCIe bus. We were interested how SCM, being much closer to the We Replaced an SSD with Storage Class Memory. Here is What We Learned. Source: mongodb.com

SpaceX Starlink speeds revealed as beta users get downloads of 11 to 60Mbps

The same tests, conducted over the past two weeks, showed latencies or ping rates ranging from 31ms to 94ms. This isn’t a comprehensive study of Starlink speeds and latency, so it’s not clear whether this is what Internet users should expect once Starlink satellites are fully deployed and the service reaches commercial availability. We asked SpaceX several questions about the speed-test results yesterday and will update this article if we get answers.

Read more

Microsoft’s underwater data centre resurfaces after two years

Two years ago, Microsoft sank a data centre off the coast of Orkney in a wild experiment. That data centre has now been retrieved from the ocean floor, and Microsoft researchers are assessing how it has performed, and what they can learn from it about energy efficiency. Their first conclusion is that the cylinder packed with servers had a lower failure rate than a conventional data centre. That compares very well with a conventional data centre.

Read more

Cloud migration: What you need to know (and where to find it)

Migrating to the cloud for an enterprise that has been running workloads on premises for years can be very daunting. To be successful, a migration plan needs to factor in many different aspects relating to people, process and technology. If you are designing the migration, you need guidance and best practices to help steer you through this process. Building on our experience as solutions architects, we have put together a comprehensive set of documents for IT practitioners who are planning, designing, and implementing a migration to Google Cloud.

Read more

Amazon EKS now supports assigning EC2 security groups to Kubernetes pods

Amazon Elastic Kubernetes Service (EKS) customers can now leverage EC2 security groups to secure applications with varying network security requirements on shared cluster compute resources. Previously, all pods on a node shared the same security groups. While IAM roles for service accounts solves the pod level security challenge at the authentication layer, many organization’s compliance requirements also mandate network segmentation as an additional defense in depth step. Kubernetes network policies provide an option for controlling network traffic within the cluster, but do not support controlling access to AWS resources outside the cluster.

Read more

Ingress for Anthos—Multi-cluster Ingress and Global Service Load Balancing

Ingress for Anthos is a Google cloud-hosted multi-cluster ingress controller for Anthos GKE clusters. Ingress for Anthos supports deploying shared load balancing resources across clusters and across regions enabling users to use a same load balancer with an anycast IP for applications running in a multi-cluster and multi-region topology. In simpler terms this allows users to place multiple GKE clusters located in different regions under one LoadBalancer. It’s a controller for the external HTTP(S) load balancer to provide ingress for traffic coming from the internet across one or more clusters by programming the external HTTP(S) load balancer using network endpoint groups (NEGs).

Read more

What we learned after a year of GitLab.com on Kubernetes

It’s been one year since we moved GitLab.com to Kubernetes. We unpack the challenges and learnings from this major migration. For about a year now, the infrastructure department has been working on migrating all services that run on GitLab.com to Kubernetes. The effort has not been without challenges, not only with moving services to Kubernetes but also managing a hybrid deployment during the transition. We have learned a number of lessons along the way that we will explore in this post.

Read more

Introducing Domain-Oriented Microservice Architecture

Recently there has been substantial discussion around the downsides of service oriented architectures and microservice architectures in particular. While only a few years ago, many people readily adopted microservice architectures due to the numerous benefits they provide such as flexibility in the form of independent deployments, clear ownership, improvements in system stability, and better separation of concerns, in recent years people have begun to decry microservices for their tendency to greatly increase complexity, sometimes making even trivial features difficult to build.

Read more

Cloudflare outage on July 17, 2020

Today a configuration error in our backbone network caused an outage for Internet properties and Cloudflare services that lasted 27 minutes. We saw traffic drop by about 50% across our network. Because of the architecture of our backbone this outage didn’t affect the entire Cloudflare network and was localized to certain geographies. The outage occurred because, while working on an unrelated issue with a segment of the backbone from Newark to Chicago, our network engineering team updated the configuration on a router in Atlanta to alleviate congestion.

Read more

GNU GCC does not round floating-point divisions to the nearest value

I know that floating-point arithmetic is a bit crazy on modern computers. For example, floating-point numbers are not associative: 0.1+(0.2+0.3) = = 0.599999999999999978 (0.1+0.2)+0.3 = = 0.600000000000000089 But, at least, this is fairly consistent in my experience. You should simply not assume fancy properties like associativity to work in the real world. Source: lemire.me