Cloud Computing
Cloud Computing | News, how-tos, features, reviews, and videos
Do you need to repatriate from the cloud?
Repatriation is one route to cost savings. Switching development patterns from long-running services to WebAssembly-powered serverless functions is another.
Snowflake’s open-source Arctic LLM to take on Llama 3, Grok, Mistral, and DBRX
Arctic will be available under the Apache 2.0 license and can be accessed via Snowflake Cortex for serverless inference or across providers such as AWS, Azure, Nvidia, Perplexity, and Together AI.
AWS moves Amazon Bedrock’s AI guardrails, and other features to general availability
The updates include new large language models and a capability to import custom models, which is currently in preview.
The cloud is not a slam dunk platform for generative AI
With public cloud providers chasing generative AI, it may be a surprise when dollars flow in other directions. Vendors and customers have a lot to consider.
The dawn of intelligent and automated data orchestration
Enterprise workflows desperately need what iPhone and Android users have enjoyed for years—ready access to files wherever and whenever they’re needed, regardless of where the files are physically located.
7 innovative ways to use low-code tools and platforms
Low-code platforms aren't just for web forms and simple integrations anymore. Here are seven innovative ways small and large enterprises are stretching the limits of what low-code can do.
AWS Snowmobile drives into the sunset
Customer demand for more efficient methods and faster online data transfer capabilities have led AWS to exit the data trucking business.
Cloud cost management is not working
As cloud costs got out of control, we turned to cost optimization approaches and tools, and have little to show for it. What now?
OpenAI's Assistants API gets a boost
The OpenAI Assistants API, used to build AI assistants, has been updated with faster and expanded file search, vector stores, and a new tool choice parameter.
Qdrant offers managed vector database for hybrid clouds
Qdrant Hybrid Cloud is based on the open-source Qdrant vector similarity search engine and vector database written in Rust.
3 secrets to deploying LLMs on cloud platforms
Let’s not make the same mistakes we did 10 years ago. It is possible to deploy large language models in the cloud more cost-effectively and with less risk.
Better application networking and security with CAKES
How the CAKES stack, centered on Kubernetes, addresses API, networking, security, and compliance challenges while speeding up delivery and lowering costs.
The cloud is benefiting IT, but not business
A recent study shows that the cloud benefits the IT department more than other business areas. That’s not enough to make it a success.
Six key takeaways from Google Cloud Next ’24
Generative AI was the dominant theme at Google Cloud Next ’24, as Google rolled out new chips, software updates for AI workloads, updates to LLMs, and generative AI-based assistants for its machine learning platform Vertex AI.
Google unveils open source projects for generative AI
Google introduced an LLM inference engine, a library of reference diffusion models, and TPU optimizations for transformer models at Google Cloud Next ’24.
Google updates Vertex AI with new LLM capabilities, agent builder feature
Other updates include grounding applications and virtual agents in Google Search via Vertex AI and Vertex AI agent builder.
Google adds Gemini to databases to aid faster code development, migration
Gemini's availability across Google Cloud database offerings is expected to help developers code and migrate faster than Duet AI, which was integrated last year.
Google’s Gemini Cloud Assist helps manage cloud apps
AI-powered assistant for Google Cloud can help design, deploy, and configure apps, troubleshoot issues, and optimize performance and costs.