Revision as of 22:33, 17 November 2015

1 DRAFT CHEAT SHEET - WORK IN PROGRESS
2 Introduction
3 What is Deserialization?
4 Guidance on Deserializing Objects Safely
5 Language-Agnostic Methods for Deserializing Safely
- 5.1 Using Alternative Data Formats
- 5.2 Only Deserialize Signed Data
6 References
7 Authors and Primary Editors
- 7.1 Other Cheatsheets

DRAFT CHEAT SHEET - WORK IN PROGRESS

Introduction

This article is focused on providing clear, simple, actionable guidance for safely deserializing untrusted data in your applications.

What is Deserialization?

Serialization is the process of turning some object into a data format that can be restored later. People often serialize objects in order to save them to storage, or to send as part of communications. Deserialization is the reverse of that process -- taking data structured from some format, and rebuilding it into an object. Today, the most popular data format for serializing data is JSON. Before that, it was XML.

However, many programming languages offer a native capability for serializing objects. These native formats usually offer more features than JSON or XML, including customizability of the serialization process. Unfortunately, the features of these native deserialization mechanisms can be repurposed for malicious effect when operating on untrusted data. Attacks against deserializers have been found to allow denial-of-service, access control, and remote code execution attacks.

Guidance on Deserializing Objects Safely

The following language-specific guidance attempts to enumerate safe methodologies for deserializing data that can't be trusted.

Java

The following techniques are all good for preventing attacks against deserialization against Java's Serializable format.

Implementation: In your code, override the ObjectInputStream#resolveClass() method to prevent arbitrary classes from being deserialized. This safe behavior can be wrapped in a library like SerialKiller. Implementation: Use a safe replacement for the generic readObject() method as seen here. Note that this addresses "billion laughs" type attacks by checking input length and number of objects deserialized. Implementation: Use a Java agent to override the internals of ObjectInputStream to prevent exploitation of known dangerous types as seen in rO0 and NotSoSerial

Prevent Data Leakage

If there are members of the object graph that should never be controlled by end users during deserialization or exposed to users during serialization, they should be marked with the transient field.

Prevent Deserialization of Domain Objects

To guarantee that your application objects can't be deserialized, a readObject() should be declared (with a final modifier) which always throws an exception.

private final void readObject(ObjectInputStream in) throws java.io.IOException {
   throw new java.io.IOException("Cannot be deserialized");
}

Override ObjectInputStream#resolveClass()

Python

Ruby

Language-Agnostic Methods for Deserializing Safely

Using Alternative Data Formats

A great reduction of risk is achieved by avoiding native deserialization formats. By switching to a pure data format like JSON or XML, you lessen the chance of custom deserialization logic from being repurposed into malicious.

Many applications rely on a data-transfer object pattern that involves creating separate objects domain of objects for the explicit purpose data transfer. Of course, it's still possible that the application will make security mistakes after a pure data object is parsed.

Only Deserialize Signed Data

If the application knows before deserialization which messages will need to be processed, they could sign them as part of the serialization process. The application could then to choose not to deserialize any message which didn't have an authenticated signature.

References

Deserialization of untrusted data

[1] [2]

Authors and Primary Editors

Arshan Dabirsiaghi - arshan [at] contrastsecurity dot org

Other Cheatsheets

OWASP Cheat Sheets Project Homepage

OWASP Cheat Sheet Series

V - T - E Cheat Sheets
Developer / Builder	3rd Party Javascript Management Access Control AJAX Security Cheat Sheet Authentication (ES) Bean Validation Cheat Sheet Choosing and Using Security Questions Clickjacking Defense Credential Stuffing Prevention Cheat Sheet Cross-Site Request Forgery (CSRF) Prevention Cryptographic Storage C-Based Toolchain Hardening CSS Security Deserialization DOM based XSS Prevention Forgot Password HTML5 Security HTTP Strict Transport Security Injection Prevention Cheat Sheet Injection Prevention Cheat Sheet in Java JSON Web Token (JWT) Cheat Sheet for Java Input Validation Insecure Direct Object Reference Prevention JAAS Key Management LDAP Injection Prevention Logging Mass Assignment Cheat Sheet .NET Security OS Command Injection Defense Cheat Sheet OWASP Top Ten Password Storage Pinning Query Parameterization REST Security Ruby on Rails Session Management SAML Security SQL Injection Prevention Transaction Authorization Transport Layer Protection TLS Cipher String Configuration Unvalidated Redirects and Forwards User Privacy Protection Web Service Security XSS (Cross Site Scripting) Prevention XML External Entity (XXE) Prevention Cheat Sheet
Assessment / Breaker	Attack Surface Analysis REST Assessment Web Application Security Testing XML Security Cheat Sheet XSS Filter Evasion
Mobile	Android Testing IOS Developer Mobile Jailbreaking
OpSec / Defender	Virtual Patching Vulnerability Disclosure
Draft and Beta	Application Security Architecture Business Logic Security Content Security Policy Denial of Service Cheat Sheet Grails Secure Code Review IOS Application Security Testing PHP Security Regular Expression Security Cheatsheet Secure Coding Secure SDLC Threat Modeling
All Pages In This Category

[[Category:Cheatsheets]]

@@ Line 8: / Line 8: @@
 Serialization is the process of turning some object into a data format that can be restored later. People often serialize objects in order to save them to storage, or to send as part of communications. Deserialization is the reverse of that process -- taking data structured from some format, and rebuilding it into an object. Today, the most popular data format for serializing data is JSON. Before that, it was XML.
-Many programming languages offer a native capability for serializing their objects. These native formats usually offer more features than JSON or XML, including customizability of the serialization process. Unfortunately, the features of these native deserialization mechanisms can be repurposed for malicious effect when operating on untrusted data. Attacks against deserializers have been found to allow denial-of-service, access control, and remote code execution attacks.
+However, many programming languages offer a native capability for serializing objects. These native formats usually offer more features than JSON or XML, including customizability of the serialization process. Unfortunately, the features of these native deserialization mechanisms can be repurposed for malicious effect when operating on untrusted data. Attacks against deserializers have been found to allow denial-of-service, access control, and remote code execution attacks.
 =Guidance on Deserializing Objects Safely=
 The following language-specific guidance attempts to enumerate safe methodologies for deserializing data that can't be trusted.
 ==Java==
-The following techniques are all good for preventing attacks against deserialization.
+The following techniques are all good for preventing attacks against deserialization against [http://docs.oracle.com/javase/7/docs/api/java/io/Serializable.html Java's Serializable format].
-* Use the signing features of a language to assure that deserialized data has not been tainted.
-Implementation: When deserializing data, populate a new object rather than just deserializing. The result is that the data flows through safe input validation and that the functions are safe.
-Implementation: Explicitly define final readObject() to prevent deserialization.
-An example of this is:
-private final void readObject(ObjectInputStream in)
-throws java.io.IOException {
-     throw new java.io.IOException("Cannot be deserialized");
-}
-Implementation: Make fields transient to protect them from deserialization.
 Implementation: In your code, override the ObjectInputStream#resolveClass() method to prevent arbitrary classes from being deserialized. This safe behavior can be wrapped in a library like SerialKiller.
 Implementation: Use a safe replacement for the generic readObject() method as seen here. Note that this addresses "billion laughs" type attacks by checking input length and number of objects deserialized.
 Implementation: Use a Java agent to override the internals of ObjectInputStream to prevent exploitation of known dangerous types as seen in rO0 and NotSoSerial
+===Prevent Data Leakage===
+If there are members of the object graph that should never be controlled by end users during deserialization or exposed to users during serialization, they should be marked with [https://docs.oracle.com/javase/7/docs/platform/serialization/spec/serial-arch.html#6250 the <code>transient</code> field].
+===Prevent Deserialization of Domain Objects===
+To guarantee that your application objects can't be deserialized, a <code>readObject()</code> should be declared (with a <code>final</code> modifier) which always throws an exception.
+<pre>private final void readObject(ObjectInputStream in) throws java.io.IOException {
+   throw new java.io.IOException("Cannot be deserialized");
+}</pre>
+===Override ObjectInputStream#resolveClass()===
 ==Python==
@@ Line 33: / Line 40: @@
 ==Ruby==
+= Language-Agnostic Methods for Deserializing Safely =
+==Using Alternative Data Formats==
+A great reduction of risk is achieved by avoiding native deserialization formats. By switching to a pure data format like JSON or XML, you lessen the chance of custom deserialization logic from being repurposed into malicious.
+Many applications rely on a [https://en.wikipedia.org/wiki/Data_transfer_object data-transfer object pattern] that involves creating separate objects domain of objects for the explicit purpose data transfer. Of course, it's still possible that the application will make security mistakes after a pure data object is parsed.
+==Only Deserialize Signed Data==
+If the application knows before deserialization which messages will need to be processed, they could sign them as part of the serialization process. The application could then to choose not to deserialize any message which didn't have an authenticated signature.
 = References =

Difference between revisions of "Deserialization Cheat Sheet"

Revision as of 22:33, 17 November 2015

DRAFT CHEAT SHEET - WORK IN PROGRESS

Introduction

What is Deserialization?

Guidance on Deserializing Objects Safely

Java

Prevent Data Leakage

Prevent Deserialization of Domain Objects

Override ObjectInputStream#resolveClass()

Python

Ruby

Language-Agnostic Methods for Deserializing Safely

Using Alternative Data Formats

Only Deserialize Signed Data

References

Authors and Primary Editors

Other Cheatsheets

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Reference

Tools