This site is the archived OWASP Foundation Wiki and is no longer accepting Account Requests.
To view the new OWASP Foundation website, please visit https://owasp.org
Difference between revisions of "Input Validation Cheat Sheet"
Line 15: | Line 15: | ||
== Goal of Output Encoding == | == Goal of Output Encoding == | ||
Output encoding is to ensure that data is sanitised before being displayed to the user | Output encoding is to ensure that data is sanitised before being displayed to the user | ||
+ | |||
+ | == Goal of securing file uploads == | ||
+ | Files uploaded to servers are secured to ensure that malware / auto-executables / OS configuration changing files etc are not uploaded to servers that can impact the confidentiality, integrity and availability of the data stored on the server or other servers. | ||
== White List Input Validation == | == White List Input Validation == | ||
Line 117: | Line 120: | ||
* In addition to the existing input validation, define a positive approach which escapes/encodes characters that can be interpreted as xml. At a minimum this includes the following: < > " ' & | * In addition to the existing input validation, define a positive approach which escapes/encodes characters that can be interpreted as xml. At a minimum this includes the following: < > " ' & | ||
* If accepting raw XML then more robust validation is necessary. This can be complex. Please contact the infrastructure security team for additional discussion | * If accepting raw XML then more robust validation is necessary. This can be complex. Please contact the infrastructure security team for additional discussion | ||
+ | |||
+ | = File Uploads = | ||
+ | ==Upload Verification== | ||
+ | *Use input validation to ensure the uploaded filename uses an expected extension type | ||
+ | *Ensure the uploaded file is not larger than a defined maximum file size | ||
+ | |||
+ | ==Upload Storage== | ||
+ | *Use a new filename to store the file on the OS. Do not use any user controlled text for this filename or for the temporary filename. | ||
+ | *Store all user uploaded files on a separate domain (e.g. mozillafiles.net vs mozilla.org). Archives should be analyzed for malicious content (anti-malware, static analysis, etc) | ||
+ | |||
+ | ==Public Serving of Uploaded Content== | ||
+ | *Ensure the image is served with the correct content-type (e.g. image/jpeg, application/x-xpinstall) | ||
+ | |||
+ | ==Beware of "special" files== | ||
+ | * The upload feature should be using a whitelist approach to only allow specific file types and extensions. However, it is important to be aware of the following file types that, if allowed, could result in security vulnerabilities. | ||
+ | *"crossdomain.xml" allows cross-domain data loading in Flash, Java and Silverlight. If permitted on sites with authentication this can permit cross-domain data theft and CSRF attacks. Note this can get pretty complicated depending on the specific plugin version in question, so its best to just prohibit files named "crossdomain.xml" or "clientaccesspolicy.xml". | ||
+ | *".htaccess" and ".htpasswd" provides server configuration options on a per-directory basis, and should not be permitted. See http://en.wikipedia.org/wiki/Htaccess | ||
+ | |||
+ | ==Upload Verification== | ||
+ | *Use image rewriting libraries to verify the image is valid and to strip away extraneous content. | ||
+ | *Set the extension of the stored image to be a valid image extension based on the detected content type of the image from image processing (e.g. do not just trust the header from the upload). | ||
+ | *Ensure the detected content type of the image is within a list of defined image types (jpg, png, etc) | ||
Revision as of 15:32, 8 January 2016
Last revision (mm/dd/yy): 01/8/2016 Introduction [hide]
This article is focused on providing clear, simple, actionable guidance for providing Input Validation security functionality in your applications. Goal of Input ValidationInput validation is performed to minimize malformed data from entering the system. Input Validation is NOT the primary method of preventing XSS, SQL Injection. These are covered in output encoding and related cheat sheets. Goal of Output EncodingOutput encoding is to ensure that data is sanitised before being displayed to the user Goal of securing file uploadsFiles uploaded to servers are secured to ensure that malware / auto-executables / OS configuration changing files etc are not uploaded to servers that can impact the confidentiality, integrity and availability of the data stored on the server or other servers. White List Input ValidationIt is always recommended to prevent attacks as early as possible in the processing of the user’s (attacker's) request. Input validation can be used to detect unauthorized input before it is processed by the application. Developers frequently perform black list validation in order to try to detect attack characters and patterns like the ' character, the string 1=1, or the <script> tag, but this is a massively flawed approach as it is typically trivial for an attacker to avoid getting caught by such filters. Plus, such filters frequently prevent authorized input, like O'Brian, when the ' character is being filtered out. White list validation is appropriate for all input fields provided by the user. White list validation involves defining exactly what IS authorized, and by definition, everything else is not authorized. If it's well structured data, like dates, social security numbers, zip codes, e-mail addresses, etc. then the developer should be able to define a very strong validation pattern, usually based on regular expressions, for validating such input. If the input field comes from a fixed set of options, like a drop down list or radio buttons, then the input needs to match exactly one of the values offered to the user in the first place. The most difficult fields to validate are so called 'free text' fields, like blog entries. However, even those types of fields can be validated to some degree, you can at least exclude all non-printable characters, and define a maximum size for the input field. Developing regular expressions can be complicated, and is well beyond the scope of this cheat sheet. There are lots of resources on the internet about how to write regular expressions, including: http://www.regular-expressions.info/ and the OWASP Validation Regex Repository. The following provides a few examples of ‘white list’ style regular expressions: White List Regular Expression ExamplesValidating a Zip Code (5 digits plus optional -4) ^\d{5}(-\d{4})?$ Validating U.S. State Selection From a Drop-Down Menu ^(AA|AE|AP|AL|AK|AS|AZ|AR|CA|CO|CT|DE|DC|FM|FL|GA|GU| HI|ID|IL|IN|IA|KS|KY|LA|ME|MH|MD|MA|MI|MN|MS|MO|MT|NE| NV|NH|NJ|NM|NY|NC|ND|MP|OH|OK|OR|PW|PA|PR|RI|SC|SD|TN| TX|UT|VT|VI|VA|WA|WV|WI|WY)$
Example validating the parameter “zip” using a regular expression. private static final Pattern zipPattern = Pattern.compile("^\d{5}(-\d{4})?$"); public void doPost( HttpServletRequest request, HttpServletResponse response) { try { String zipCode = request.getParameter( "zip" ); if ( !zipPattern.matcher( zipCode ).matches() { throw new YourValidationException( "Improper zipcode format." ); } .. do what you want here, after its been validated .. } catch(YourValidationException e ) { response.sendError( response.SC_BAD_REQUEST, e.getMessage() ); } } Some white list validators have also been predefined in various open source packages that you can leverage. For example:
Client Side vs Server Side ValidationBe aware that any JavaScript input validation performed on the client can be bypassed by an attacker that disables JavaScript or uses a Web Proxy. Ensure that any input validation performed on the client is also performed on the server. Positive ApproachThe variations of attacks are enormous. Use regular expressions to define what is good and then deny the input if anything else is received. In other words, we want to use the approach "Accept Known Good" instead of "Reject Known Bad" Example A field accepts a username. A good regex would be to verify that the data consists of the following [0-9a-zA-Z]{3,10}. The data is rejected if it doesn't match. A bad approach would be to build a list of malicious strings and then just verify that the username does not contain the bad string. This approach begs the question, did you think of all possible bad strings? Robust Use of Input ValidationAll data received from the user should be treated as malicious and verified before using within the application. This includes the following
Input ValidationData recieved from the user should be validated for the following factors as well: 1. Boundary conditions (Out of range values) 2. Length of the data inputed (for example, if the input control can accept only 8 character, the same should be validated while accepting the data. The input chars should not exceed 8 characters). Validating Rich User ContentIt is very difficult to validate rich content submitted by a user. Consider more formal approaches such as HTML Purifier (PHP), AntiSamy or bleach (Python) Preventing XSS and Content Security Policy
Detailed information on XSS prevention here: OWASP XSS Prevention Cheat Sheet Output EncodingPreventing SQL Injection
Further Reading: SQL Injection Prevention Cheat Sheet Preventing OS Injection
Further Reading: Reviewing Code for OS Injection Preventing XML Injection
File UploadsUpload Verification
Upload Storage
Public Serving of Uploaded Content
Beware of "special" files
Upload Verification
Authors and Primary EditorsDave Wichers - dave.wichers [at] aspectsecurity.com Other Cheatsheets |