This site is the archived OWASP Foundation Wiki and is no longer accepting Account Requests.
To view the new OWASP Foundation website, please visit

Category:OWASP Learn About Encoding Project

Revision as of 22:06, 12 September 2009 by Federico.casani (talk | contribs)

Jump to: navigation, search



Starting with projects such as overtime

The "OWASP Learn About Encoding Project" has not discovered anything new, but rather wants to emphasize the importance of input sanitize and output escaping. In the network there are often errors in the visualization of pages: you see question marks (?) where it should be accented letters, there are strange characters (i.e. A+tilde, A+umlauts) where this should be the "euro" character, and so way. Not only that: but there are communication channels that allow the exchange of characters not properly controlled: i.e. sms messages, chat messages, voip client, ecc.. often contain values are not consistent.

The use of proper Charset is essential for

  • integrity of the data: if we take in input some characters, we want to "see" the same characters in output
  • the prevention of the problem of Canonicalization: the knowledge of Charsets is the first thing to do


This is a project that aims to educate developers, systems analysts or anyone who writes code regarding the knowledge of proper use of Charset and Canonicalization. The project will seek to give a comprehensive response by crossing one another most scenarios highlighting the roles of key players (browser, operating system, database, etc. ..).

To achieve this goal we decided to create a tool in three different formats:

  • web application
  • swing application
  • shell tool


Why do I have to understand about encoding?
Why do I have to understand about charset?
You can find here some wiki documents.



Detailed roadmap for future developments:

01/03/09 : Startup

01/03/09 - 31/03/09 : Project idea definition

01/04/09 - 15/04/09 : Proof of concept

16/04/09 - 30/04/09 : Project goal definition

01/07/09: Code Development

01/12/09 : Alpha release


Code is hosted by Google Code here
You can download the source code from here.

Project Information

Project Name OWASP Learn About Encoding Project
Short Project Description

This project has as its ultimate goal of demystifying the problems related to the study of character encoding (charset encoding). From charset's proper use to the issue of canonicalization, we'll try to explain and resolve the problems related to this issue so dear to professionals in the ICT world. The project consist of: a web application that explain the character life cycle and a usable textual tool and GUI tool.

Key Project Information

Project Leader
Federico Casani
Andrea Zonzin

Project Contibutors
(if any)

Mailing List
Subscribe here
Use here

Creative Commons Attribution Share Alike 3.0

Project Type

add link(s)

Release Status Main Links Related Projects

Apha Quality
Please see here for complete information.

Blog if any, add link(s)


A Java security improvement


The project hasn't yet a contributor. I you want to become a contributor start from mailing list:
Subscribe here
Use here

Content license:Creative Commons 3.0 BY-SA - Code license:GNU Lesser General Public License

This category currently contains no pages or media.