Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

range based for loop for objects #83

Closed
nickdesaulniers opened this issue Jun 5, 2015 · 32 comments
Closed

range based for loop for objects #83

nickdesaulniers opened this issue Jun 5, 2015 · 32 comments

Comments

@nickdesaulniers
Copy link

Note that the readme recommends this syntax for iterating key/value pairs of an object:

for (json::iterator it = o.begin(); it != o.end(); ++it) {
  std::cout << it.key() << " : " << it.value() << "\n";
}

If we try to iterate an nlohmann::json object using a range based for, we only seem to get the value:

// clang++ example.cpp -std=c++11 -I lib/
#include "json.hpp"
int main () {
  using nlohmann::json;
  json j = json::parse("{ \"hello\": \"world\", \"name\": \"Nick\" }");
  for (json::iterator it = j.begin(); it != j.end(); ++it) {
    std::cout << it.key() << " : " << it.value() << "\n";
  }
  for (auto pair : j) {
    std::cout << pair << "\n";
    // std::cout << pair.first << " : " << pair.second << "\n";
  }

  std::map<std::string, std::string> map { {"hello", "world"}, {"name", "Nick"} };
  for (auto& pair: map) {
    std::cout << pair.first << " : " << pair.second << "\n";
    //std::cout << pair << "\n";
  }
}

prints:

hello : "world"
name : "Nick"
"world"
"Nick"
hello : world
name : Nick

Using some fancyness, it should be possible to provide an API similar to std::map et al for iterating key value pairs of an object.

@nlohmann
Copy link
Owner

nlohmann commented Jun 7, 2015

I understand what you mean. I would need to overwork the iterators - I'll check how much effort this would mean. One aspect that needs to be addressed is the fact that the same approach needs to be applicable likewise to arrays. There, it would be cumbersome to access the value with pair.second. What do you think?

@nickdesaulniers
Copy link
Author

the same approach needs to be applicable likewise to arrays. There, it would be cumbersome to access the value with pair.second. What do you think?

Yeah, it would be preferable to not have to access the value of array elements through a std::pair. Do note that in JavaScript, arrays are objects with numeric unsigned monotonically increasing keys. I'm not sure how much of that applies to JSON; it might be weird for the array iterator to be:

first second
0 "hello"
1 "world"
2 "foo"

ex.

Object.keys([0, 1, 2]) // Array [ "0", "1", "2" ]

@mlund
Copy link

mlund commented Jun 11, 2015

This is essentially what I was after in issue #67 and would be happy to see map-like behavior for range based loops.

@nlohmann
Copy link
Owner

Hi @nickdesaulniers and @mlund, I thought about the issue and came to the following idea I would like to discuss with you:

I learned the types of the objects that are created in a range-based for are determined by the return value of iterator::operator*(). There, I currently return reference which is a typedef for basic_json &. To allow the access to the key during iteration, I would replace this type by a wrapper class like this:

class basic_json_wrapper
{
  public:
    // create wrapper objects given a key and a value
    inline basic_json_wrapper(const string_t& key, basic_json& value)
        : m_key(key), m_value(value)
    {}

    // implicit type conversion to basic_json
    template <class V, typename
              std::enable_if<
                  std::is_convertible<basic_json, V>::value, int>::type
              = 0>
    inline operator V & ()
    {
        return m_value;
    }

    // explicit access to the key
    inline string_t& key()
    {
        return m_key;
    }

    // explicit access to the value
    inline basic_json& value()
    {
        return m_value;
    }

  private:
    // the key (object key or array index)
    string_t m_key = "";
    // the basic_json value
    basic_json& m_value;
};

This class would behave just like a basic_json object (via implicit conversion, but also by forwarding other calls to the member m_value), but have the same member functions key() and value() the "normal" iterator classes have.

For objects, I would set m_key to the object's key. For arrays, I would create a string of the array index. For all other JSON types, the key would be empty.

The code above is not complete yet - I would need to extend the basic_json_wrapper class to have the same public member functions like basic_json (is there an elegant way to do this in C++?). Nevertheless, I would like to hear your opinion about this. Personally, I would prefer this approach to a simple std::pair<std::string, basic_json>, because then I would be forced to always use it.second to access the value even though the JSON value I iterate is an array. What do you think?

The code above would allow an example like this:

#include "src/json.hpp"

using nlohmann::json;

int main()
{
    json foo = {{"one", 1}, {"two", 2}};

    for (auto i : foo) {
        std::cerr << i << std::endl;
        std::cerr << i.key() << std::endl;
        std::cerr << i.value() << std::endl;
    }

    // print: 1, 2
    // print: "one", "two"
    // print: 1, 2

    json bar = {"one", "two", "three"};
    for (auto i : bar) {
        std::cerr << i << std::endl;
        std::cerr << i.key() << std::endl;
        std::cerr << i.value() << std::endl;
    }

    // print: "one", "two", "three"
    // print: "0", "1", "2"
    // print: "one", "two", "three"
}

@nickdesaulniers
Copy link
Author

lgtm, but to be consistent with std::map I would prefer i.first and i.second.

@mlund
Copy link

mlund commented Jun 12, 2015

Both have merits; I'm personally in favour of the wrapper due to the cumbersome pair for arrays. std-like behaviour is appealing, though, and I wonder if one could return a std::pair-derivative for arrays that can be implicitly converted into the data type std::pair::T2, i.e. int or string in the examples above?
That is, something like this:

for (auto &i : bar) {
    string a = i; // "one" ...
    string b = i.second; // "one" ...
}

The derived pair could be implemented like this:

template<class T1, class T2, class base=std::pair<T1,T2> >
struct mypair : public base {
    mypair(T1 a, T2 b) : base(a,b) {}
    operator T2() const { return base::second; }
};
int main()
{
    mypair<string,float> v("hej", 1.0);
    cout << v << endl; // -> 1.0
}

@nlohmann
Copy link
Owner

I currently see whether I can use an approach similar to std::vector<bool>::reference. Once I found a way to return a proxy object, I still can add members first and second to mimic the std::pair approach.

@kepkin
Copy link

kepkin commented Jun 17, 2015

Neils, I would really appreciate if you check my iterator reimplementation before doing your changes with iterators, if you didn't begin. Otherwise I may have hard time to merge my changes in #88. If you want I may create a separate pull request just for iterators.

@nlohmann
Copy link
Owner

Hi @kepkin, sorry for letting you wait. I did not find the time to look at your PR yet. I really appreciate any work that helps us getting closer to a MSVC build. But since I would like to avoid too many conditionals or other "hacks", I really would like to take some time investigating. I haven't made any changes with respect to this issue yet, and I will not commit anything related to iterators before I checked your code. Right now, I hope to be able to do this on the weekend.

@nlohmann nlohmann added this to the Release 3.0 RC milestone Jul 1, 2015
@nlohmann
Copy link
Owner

I checked the implementation of the vector<bool>::reference and think that approach is not really helpful. A simple way to be able to access key() in a range base for could be this one: http://stackoverflow.com/a/14920606/266378: It creates a wrapper which allows to access the iterator rather than the elements themselves. What do you think?

@nlohmann
Copy link
Owner

Alright, here would be my implementation:

/*!
@brief wrapper to access iterator member functions in range-based for

This class allows to access @ref key() and @ref value() during range-based
for loops. In these loops, a reference to the JSON values is returned, so
there is no access to the underlying iterator.
*/
class iterator_wrapper
{
  private:
    /// the container to iterate
    basic_json& container;
    /// the type of the iterator to use while iteration
    using json_iterator = decltype(container.begin());

    /// internal iterator wrapper
    class iterator_wrapper_internal
    {
      private:
        /// the iterator
        json_iterator anchor;
        /// an index for arrays
        size_t array_index = 0;
        size_t container_size = 0;

        /// calculate a key for the iterator
        std::string calculate_key()
        {
            array_index++;

            switch (anchor.m_object->type())
            {
                /// use integer array index as key
                case (value_t::array):
                {
                    return std::to_string(array_index - 1);
                }

                /// use key from the object
                case (value_t::object):
                {
                    return anchor.key();
                }

                /// use an empty key for all primitive types
                default:
                {
                    return "";
                }
            }
        }

      public:
        /// construct wrapper given an iterator
        iterator_wrapper_internal(json_iterator i, size_t s)
            : anchor(i), container_size(s), first(calculate_key()), second(*i)
        {}

        /// dereference operator (needed for range-based for)
        iterator_wrapper_internal operator*()
        {
            return *this;
        }

        /// increment operator (needed for range-based for)
        iterator_wrapper_internal operator++()
        {
            ++anchor;

            first = calculate_key();

            if (array_index < container_size - 1)
            {
                second = *anchor;
            }

            return *this;
        }

        /// inequality operator (needed for range-based for)
        bool operator!= (const iterator_wrapper_internal& o)
        {
            return anchor != o.anchor;
        }

        /// stream operator
        friend std::ostream& operator<<(std::ostream& o, const iterator_wrapper_internal& w)
        {
            return o << w.value();
        }

        /// return key of the iterator
        typename basic_json::string_t key() const
        {
            return first;
        }

        /// return value of the iterator
        basic_json value() const
        {
            return second;
        }

        /// public member to simulate std::map::iterator access
        typename basic_json::string_t first;
        /// public member to simulate std::map::iterator access
        basic_json& second;
    };

  public:
    /// construct iterator wrapper from a container
    iterator_wrapper(basic_json& cont)
        : container(cont)
    {}

    /// return iterator begin (needed for range-based for)
    iterator_wrapper_internal begin()
    {
        return iterator_wrapper_internal(container.begin(), container.size());
    }

    /// return iterator end (needed for range-based for)
    iterator_wrapper_internal end()
    {
        return iterator_wrapper_internal(container.end(), container.size());
    }
};

To use the wrapper, consider the following example code:

#include <json.hpp>
using nlohmann::json;

int main()
{
    json j_object = {{"one", 1}, {"two", 2}, {"three", 3}};
    json j_array = {"foo", "bar", "baz"};

    for (auto i : json::iterator_wrapper(j_object))
    {
        std::cout << "i.key() = " << i.key() << ", ";
        std::cout << "i.first = " << i.first << '\n';
        std::cout << "i.value() = " << i.value() << ", ";
        std::cout << "i.second = " << i.second << ", ";
        std::cout << "*i = " << *i << ", ";
        std::cout << "i = " << i << '\n';
    }

    for (auto i : json::iterator_wrapper(j_array))
    {
        std::cout << "i.key() = " << i.key() << ", ";
        std::cout << "i.first = " << i.first << '\n';
        std::cout << "i.value() = " << i.value() << ", ";
        std::cout << "i.second = " << i.second << ", ";
        std::cout << "*i = " << *i << ", ";
        std::cout << "i = " << i << '\n';
    }
}

The output is

i.key() = one, i.first = one
i.value() = 1, i.second = 1, *i = 1, i = 1
i.key() = three, i.first = three
i.value() = 1, i.second = 1, *i = 1, i = 1
i.key() = two, i.first = two
i.value() = 1, i.second = 1, *i = 1, i = 1
i.key() = 0, i.first = 0
i.value() = "foo", i.second = "foo", *i = "foo", i = "foo"
i.key() = 1, i.first = 1
i.value() = "foo", i.second = "foo", *i = "foo", i = "foo"
i.key() = 2, i.first = 2
i.value() = "foo", i.second = "foo", *i = "foo", i = "foo"

I would be happy for feedback, @nickdesaulniers @mlund @kepkin.

@nickdesaulniers
Copy link
Author

I think something is wrong with the output? The value of the second and third element should be "bar" and "baz" respectively. Assuming simple copy+paste error, this looks great! Nice job!

Maybe a separate issue we can discuss in another ticket; keys in JSON are not sorted, but ordered based on definition. This becomes apparent when iterating keys of an object.

@nlohmann
Copy link
Owner

Hi @nickdesaulniers,

thanks for checking by. In fact, there was still a bug in the code. The output should, of course, read:

i.key() = one, i.first = one
i.value() = 1, i.second = 1, *i = 1, i = 1
i.key() = three, i.first = three
i.value() = 3, i.second = 3, *i = 3, i = 3
i.key() = two, i.first = two
i.value() = 2, i.second = 2, *i = 2, i = 2
i.key() = 0, i.first = 0
i.value() = "foo", i.second = "foo", *i = "foo", i = "foo"
i.key() = 1, i.first = 1
i.value() = "bar", i.second = "bar", *i = "bar", i = "bar"
i.key() = 2, i.first = 2
i.value() = "baz", i.second = "baz", *i = "baz", i = "baz"

If no-one objects, I'll push this to master tomorrow.

About the ordering of keys: please open an issue.

@mlund
Copy link

mlund commented Jul 27, 2015

Thanks for this — it looks great.
/ Mikael

On 27 Jul 2015, at 21:17, Niels [email protected] wrote:

Hi @nickdesaulniers https://github.com/nickdesaulniers,

thanks for checking by. In fact, there was still a bug in the code. The output should, of course, read:

i.key() = one, i.first = one
i.value() = 1, i.second = 1, *i = 1, i = 1
i.key() = three, i.first = three
i.value() = 3, i.second = 3, *i = 3, i = 3
i.key() = two, i.first = two
i.value() = 2, i.second = 2, *i = 2, i = 2
i.key() = 0, i.first = 0
i.value() = "foo", i.second = "foo", *i = "foo", i = "foo"
i.key() = 1, i.first = 1
i.value() = "bar", i.second = "bar", *i = "bar", i = "bar"
i.key() = 2, i.first = 2
i.value() = "baz", i.second = "baz", *i = "baz", i = "baz"
If no-one objects, I'll push this to master tomorrow.

About the ordering of keys: please open an issue.


Reply to this email directly or view it on GitHub #83 (comment).

@nickdesaulniers
Copy link
Author

🚀 🎸 🍸 🌴

nlohmann added a commit that referenced this issue Jul 30, 2015
nlohmann added a commit that referenced this issue Aug 6, 2015
@nlohmann
Copy link
Owner

nlohmann commented Aug 6, 2015

There is currently still an issue with the code for the wrapper: The previous builds on Travis all failed due to a segmentation fault.

@vanderlokken
Copy link

Hi. Do you have any plans to add a version of iterator_wrapper which can be constructed from a constant reference?

And by the way in my opinion this name, 'iterator_wrapper', is so-so since it's hard to guess what exactly this thing does. One could expect that an instance of this class can be constructed from an iterator, but it can't. I'd call this class 'member_iterator_range' or something like this.

nlohmann added a commit that referenced this issue Dec 22, 2015
@nlohmann
Copy link
Owner

Hi @vanderlokken, the iterator_wrapper (I'm not happy about the name either...) now also works for const objects. It is now realized by a static function, but a member function should also be possible. This could allow for code like

for (auto v : j.values())
{
}

Any better idea?

@EvilPudding
Copy link

Why not the javascript way?

for( var prop in obj )
{
    var p = obj[prop];
}
for( auto prop: obj )
{
    json p = obj[prop]
}

This would mean the iterator only returns the keys,
and that the lookup for the value would have to be
done inside the loop, which has performance problems
i'm guessing.

It would also make it so the syntax for both objects
and arrays could be the same.

Also note, that this method allows you to have both
the value and the key.

This could also work without removing any of the other
features, namely, having the .values() only iterate
values, but having the default iterator be the keys.

@RElesgoe
Copy link

@EvilPudding That looks very unnatural in C++. IMO, it should simply be:

for (auto obj : json)
    std::cout << obj.key() << ":" << obj.value() << std::endl;

@EvilPudding
Copy link

@xboi209 What would be the type of that obj? If
you want to keep natural to C++, then you should
only use the map standard iterator, and access
the key with .first and the value with .second.

C++ programmers are familiar with iterating through
a map, so it wouldn't be a stretch.

@RElesgoe
Copy link

@EvilPudding The type of obj would be a custom type defined by this library, but using the map container could work as well.

@EvilPudding
Copy link

@xboi209 I would hate it if it was done, but
it would work like you're saying, if we were to
extend the map so instead of pairs it takes a
costume pair extension that has key and value
as aliases.

The problem with this solution is that we would
be leaving standard maps, and other C++
programmers would assume we're just using
maps and be lead to false conclusions.

@nlohmann
Copy link
Owner

I tried to mimic the STL as close as possible. Therefore, JSON objects shall behave as stl::map, and keys/values are available as first/second.

However, I don't like how maps behave with range fors, therefore I added the wrapper.

@gregmarr
Copy link
Contributor

What part of the map range for behavior do you not like?

@nlohmann
Copy link
Owner

@gregmarr If I do a range for over a JSON object, I only get a reference to the stored object, but have no way to access the key. This is different to when I use iterators explicitly, because I overloaded the iterators with a key function.

@gregmarr
Copy link
Contributor

Oh, I thought you meant you didn't like std::map behavior, not json object behavior.

@nlohmann
Copy link
Owner

Ok - all that is left open in this issue would be the name of the wrapper. Does anyone has better ideas?

@nlohmann nlohmann added the state: please discuss please discuss the issue or vote for your favorite option label Dec 28, 2015
@nlohmann nlohmann removed this from the Release 1.0.0-rc1 milestone Dec 28, 2015
@nlohmann nlohmann removed the state: please discuss please discuss the issue or vote for your favorite option label Jan 14, 2016
@keithel
Copy link

keithel commented Sep 14, 2017

Is this issue now defunct and obsolete? I see it's closed - and I don't see the last proposed api available in Release version 2.1.1.

I'm trying to simply iterate over the key-value pairs at the leaf-end of a json document -- such that it's simply simple key to basic type value pairs. I can get the values just by doing a range-based for loop, but seems I have no way to get the associated key.

    for (auto config_setting : mode_config)
    {
        std::cout << "Value " << config_setting.front() << std::endl;
    }

The above works for getting the value, but there seems to be no key accessor...

@nlohmann
Copy link
Owner

It is called json::iterator_wrapper() and used like this:

for (auto it : json::iterator_wrapper(mode_config))
{
    std::cout << "key: " << it.key() << ", value: " << it.value() << std::endl;
}

@keithel
Copy link

keithel commented Sep 14, 2017

Ok! Thank you! That makes sense. I had ended up using classic STL iterators to access it but the range-based for is cleaner and easier to understand.

@theShmoo
Copy link
Contributor

theShmoo commented Oct 4, 2021

For those who find this PR:

The way to go is now:

for (auto it : mode_config.items())
{
    std::cout << "key: " << it.key() << ", value: " << it.value() << std::endl;
}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

10 participants